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ABSTRACT 

Context. Recent results have revised upwards the total X-ray background (XRB) intensity below ~10 keV, therefore an accurate de- 
termination of the source counts is needed. There are also contradicting results on the clustering of X-ray selected sources. 
Aims. We have studied the X-ray source counts in four energy bands soft (0.5-2 keV), hard (2-10 keV), XID (0.5-4.5 keV) and 
ultra-hard (4.5-7.5 keV), to evaluate the contribution of sources at different fluxes to the X-ray background. We have also studied the 
angular clustering of X-ray sources in those bands. 

Methods. AXIS (An XMM-Newton International Survey) is a survey of 36 high Galactic latitude XMM-Newton observations cov- 
ering 4.8 deg- and containing 1433 serendipitous X-ray sources detected with 5-cr significance. This survey has similar depth to the 
XMM-Newton catalogues and can serve as a pathfinder to explore their possibilities. We have combined this survey with shallower 
and deeper surveys, and fitted the source counts with a Maximum Likelihood technique. Using only AXIS sources, we have studied 
the angular correlation using a novel robust technique. 

Results. Our source counts results are compatible with most previous samples in the soft, XID, ultra-hard and hard bands. We have 
improved on previous results in the latter band. The fractions of the XRB resolved in the surveys used in this work are 87%, 85%, 
60% and 25% in the soft, hard, XID and ultra-hard bands, respectively. Extrapolation of our source counts to zero flux are not enough 
to saturate the XRB intensity. Only galaxies and/or absorbed AGN may be able contribute the remaining unresolved XRB intensity. 
Our results are compatible, within the errors, with recent revisions of the XRB intensity in the soft and hard bands. The maximum 
fractional contribution to the XRB comes from fluxes within about a decade of the break in the source counts (~ 10 cgs), reaching 
~50% of the total in the soft and hard bands. Angular clustering (widely distributed over the sky and not confined to a few deep fields) 
is detected at 99-99.9% significance in the soft and XID bands, with no detection in the hard and ultra-hard band (probably due to the 
smaller number of sources). We cannot confirm the detection of significantly stronger clustering in the hard-spectrum hard sources. 
Conclusions. Medium depth surveys such as AXIS are essential to determine the evolution of the X-ray emission in the Universe 
below 10 keV. 

Key words. Surveys - X-rays: general - (Cosmology:) large-scale structure of the Universe 



1. Introduction 

The X-ray background (Giacconi et al. 1 19621 1 is a testimony 
of the history of accretion power in the Universe (Fabian 
& Iwasawa 1999, Soltan '1982"), and as such its sources and 
their evolution have attracted considerable attention (Fabian 
& Barcons I1992I ). Deep pencil-beam surveys (Loaring et al. 
120051 Bauer et al. |200?1 Cowie et al. ,2002i Giacconi et al. 
I2OO2I Hasinger et al. lOO]} have resolved most (^ 90%) of 
the soft (0.5-2 keV) XRB into individual sources. Most of these 
turn out to be unobscured and obscured Active Galactic Nuclei 
(AGN), while at the fainter fluxes a population of "normal" 
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galaxies starts to contribute significantly to the source counts 
(Hornschemeier et al. I2003I I. At higher energies the resolved 
fraction is smaller (Worsley et al. '2004): 50% between 4.5 and 
7.5 keV, and less than 50% between 7.5 keV and 10 keV. Models 
for the XRB based on the Unified Model for AGN reproduce 
successfully its spectrum with a combination of unobscured and 
obscured AGN, the later being the dominant population (Setti & 
Woltjer 1989, Gifli et al. I^OOTT ). R ecent h ard X-ray AGN lumi- 
nosity function results (Ueda et al.|20031 La Franca et al. 2005]l 
show that such simple models need considerable revision, with 
lower absorbed AGN fraction at higher luminosities and lumi- 
nosity dependent density evolution of the hard X-ray AGN lu- 
minosity. 
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The fainter sources from deep surveys do not contribute 
much to the final XRB intensity, and are often too faint opti- 
cally and in X-rays to be studied individually. Stacking of X-ray 
spectra is a valuable tool in these circumstances (Worsley et al. 
12006. Civano, Comastri & Brusa 2005 ), but only average proper- 
ties can be studied in this fashion, missing all the diversity in the 
nature of the sources necessary to account for the XRB. Wide 
shallow surveys over a large portion of the sky (Schwope et al. 
120001 Delia Ceca et al. 2004 Ueda et al. [2003i l allow detecting 
minority populations and studies of sources on an individual ba- 
sis, and serve as a framework against which to study and detect 
evolution, but again with only a minor contribution to the XRB. 

Medium depth wide-band surveys combining many sources 
with relatively wide sky coverage ( Hasinger et al. 120071 Pierre 
et al.l^OO? Barcons et al.'2002l, Baldi et al. "20021) have the poten- 
tial to provide many of the missing pieces of the XRB/AGN puz- 
zle, since most (30-50%) of the XRB below 10 keV comes from 
sources at intermediate fluxes (Fabian & Barcons , 1992. Barcons, 
Carrera & Ceballos I2006I I. and the optical and X-ray spectra 
of many sources can be studied individually (e.g. Mateos et al. 
[2005 ). Furthermore, extensive XMM-Newton source catalogues 
are available (IXMIvfl-SSC 2003 ■), and even larger ones will be 
available in the near future (2XMM,. . .), which, since they are 
constructed from serendipitous sources in typical public XMM- 
Newton observations, are themselves medium flux surveys. The 
exploitation of the full potential of these catalogues depends on 
the results from "conventional" smaller scale medium flux sur- 
veys such as those above and the one presented here. 

Since AGN are the dominant population at medium and low 
X-ray fluxes, and X-ray selected AGN are known to cluster 
strongly (Yan g et al . |20U6l GilH et al. 2005 MuUis et al. !2004l 
Carrera et al. I1998I I, it is interesting to investigate the angular 
clustering of X-ray sources, which can be performed without re- 
sorting to expensive ground-based spectroscopy. Angular corre- 
lation can be translated into spatial clustering via Limber's equa- 
tion (Peebles 1980), if the luminosity function of the popula- 
tions involved is known from other surveys. Several studies have 
investigated the angular clustering in the soft and hard bands 
in scales of tens-hundreds of arcsec to degrees, with varying 
success in the detection of signal. Angular clustering has been 
detected in the soft band (Gandhi et al. ,2006. Basilakos et al. 
120051 Akylas et al. l2000l Vikhlinin & Forman [1995]) wifli dif- 
ferent values of the clustering strength. Angular clustering in 
the hard band has evaded detection in some cases (Gandhi et 
al. ^ 006 Puccetti et al. 2006), but not in others (Yang et al. 
12003 Basilakos et al. 2004), in the latter cases finding clustering 
strengths marginally compatible with the observed spatial clus- 
tering of optical and X-ray AGN. 

We present here a medium X-ray survey which includes 
1433 serendipitous sources over 4.8 deg^: AXIS (An X-ray 
International Survey). This survey was undertaken under the aus- 
pices of the XMM-Newton Survey Science Centi-43 (SSC). In 
this paper we present the source catalogue (Section [2]), overall 
X-ray spectral properties (Section[3]l, number counts (Section[4| 

' The first XMM-Newton Serendipitous Source Catalogue (IXMM), 
released on 2003, contains source detections drawn from 585 XMM- 
Newton EPIC observations, and a total of ~30 000 individual X-ray 
sources. The median 0.2-12 keV flux of the catalogue sources is ~ 3 x 
10"''' cgs, with ~12% of them having fluxes below 10"''' cgs. 

^ The XMM-Newton Survey Science Centre is an international col- 
laboration involving a consortium of 10 institutions appointed by ESA 
to help the SOC in developing the software analysis system, to pipeline 
process all the XMM-Newton data, and to exploit the XMM-Newton 
serendipitous detections, see ht tp://xnmssc-www. star . le . ac.uk. 



where some other samples have also been used), fractional con- 
tribution to the XRB (Section [5), and angular clustering proper- 
ties (Section[6]). Overall results are summarised in Section[2] In 
a forthcoming paper (Barcons et al. 2007 ) we present the op- 
tical identifications of a subsample of AXIS, which we have 
called the XMM-Newton Medium Survey (XMS). In this paper 
we will use cgs as a shorthand for the cgs system units for the 
flux: erg cm"^ s"'. 

2. The X-ray data 

2.1. Selection of fields 

A total of 36 XMM-Newton observations were selected for opti- 
cal follow-up of X-ray sources within the AXIS programme (see 
Table [TJ, preferring those that were public early on (mid 2000), 
or belonging to the SSC Guaranteed Time. We selected fields 
with I b \> 20°, total exposure time >15 ks, and devoid of (op- 
tical and X-ray) bright or extended targets (except in two cases: 
A1837 and A399). Furthermore, after discarding the observing 
intervals with high background rates, a few of the fields ended 
up with exposure times shorter than 15 ks. A few fields (some 
with shorter exposure times) were only intended to expand the 
solid angle for bright X-ray sources. All of these fields have been 
used for the study of the cosmic variance, the angular correlation 
function and the logA^-log^ in different bands. 

Of those 36 XMM-Newton observations, 27 were selected 
for optical follow-up of medium flux X-ray sources. Two of 
those 27 fields (A2690 and MS2137) were later dropped from 
the main identification eff'ort, because their Declination was too 
low to observe them from Calar Alto (Spain) with airmass lower 
than 2. The sources in the remaining 25 fields were used to form 
flux-limited samples in the 0.5-2 keV, 2-10 keV, 0.5-4.5 keV 
band, and a non-flux-limited sample in the 4.5-7.5 keV bands: 
the XMS (see Barcons et al. 120071 ) . These 25 fields are marked 
in Table [1] 

2.2. Data processing and relation to 1XMf\/l 

The data used in our earlier follow-up efforts (see previous 
Section) had been reduced with very different versions of the 
SAS ( Science Analysis System, Gabriel et al. 120041) . The repro- 
cessing for the IXMM catalogue (SSC'2003) allowed us to have 
a much more homogeneous set of X-ray data. The Observation 
Data Files (ODF) were processed in the SSC Pipeline Processing 
System (PPS) facilities at Leicester with the same SAS ver- 
sion used for IXMM (very similar to SAS version 5.3.3, see 
below for the difference), except for Mkn205, which was pro- 
cessed with SAS version 5.3.3. PHL1092is a especial case, since 
the original XMM-Newton observation was never re-processed, 
instead we took a newer set of data from the XMM-Newton 
Science Archive, which was processed later with a different SAS 
version (5.4.0). 

The main difference between the versions of the SAS source 
detection task (emldetect) used for Mkn205 and the rest of the 
fields is the inclusion of the possibility of sources with negative 
count rates in the latter. Negative count rates in individual en- 
ergy bands were allowed to avoid a bias in the total count rates 
of sources that were undetected in one or more energy bands. 
However, since this option caused numerical problems in some 
cases, the count rates were limited to values > in later versions 
of the pipeline. Since negative count rates are meaningless, we 
have set all the negative count rates to zero in what follows. The 
coiTesponding detection likelihoods have also been set to zero. 



Table 1. Basic data for the 36 fields in the AXIS survey: including the target name, the XMM-Newton OBSJD number of the pn dataset used, the R.A. and Dec. of the field 
centre, the "clean" exposure time and the filter used for the pn camera, the Galactic column density in the direction of the field (Dickey & Lockman 1990), and, when applicable, 
the centre and radius of the circular target exclusion area, the width of the OOT exclusion area, and the centre and radius of an additional exclusion area around bright/extended 
sources. 



Target name 


OBSJD 


R.A. 


Dec 


Texp 


Filter 


A'H.Gal 


R.A. 


Dec. 


R 


OOT 


R.A. 


Dec. 


R' 






(J2000) 


(J2000) 


(s) 




(lO^" cm-2) 


(J2000) 


(J2000) 


(") 


(") 


(J2000) 


(J2000) 


(") 


A2690 


0125310101 


00:00:30.30 


-25:07:30.00 


21586.7 


Medium 


1.84 


00:00:21.2 


-25:08:12.18 


20 












C10016+1609'' 


0111000101 


00:18:33.00 


+ 16:26:18.00 


29149.3 


Medium 


4.07 


00:18:33.2 


+ 16:26:07.97 


148 












G133-69pos_2° 


0112650501 


01:04:00.00 


-06:42:00.00 


18080.0 


Thin 


5.19 



















G133-69Pos_r 


0112650401 


01:04:24.00 


-06:24:00.00 


20000.0 


Thin 


5.20 


- 


- 










- 





PHL1092 


0110890501 


01:39:56.00 


+06:19:21.00 


16180.0 


Medium 


4.12 


01:39:55.8 


+06:19:19.67 


88 


40 


01:40:09.0 


+06:23:27.67 


68 


SDS-lb"'' 


0112371001 


02:18:00.00 


-05:00:00.00 


43040.0 


Thin 


2.47 


- 


- 








- 


- 





SDS-3'''' 


0112371501 


02:18:48.00 


-04:39:00.00 


14927.9 


Thin 


2.54 


- 


- 








- 


- 





SDS-2'''' 


0112370301 


02:19:36.00 


-05:00:00.00 


40673.0 


Thin 


2.54 


- 


- 








- 


- 





A399'''' 


0112260201 


02:58:25.00 


+ 13:18:00.00 


14298.7 


Thin 


11.10 


02:57:50.2 


+ 13:03:20.88 


272 





- 


- 







0111220201 


06:15:36.30 


+71:02:04.90 


44506. 1 


Medium 


8.82 


06:15:36.6 


+71:02:15.95 


76 


32 


- 


- 





MS0737.9+744r'' 


0123100201 


07:44:04.50 


+74:33:49.50 


20209.3 


Thin 


3.51 


07:44:04.3 


+74:33:54.56 


120 


40 


- 


- 





S5 0836+71° 


0112620101 


08:41:24.00 


+70:53:40.70 


25057.3 


Medium 


2.98 


08:41:24.3 


+70:53:41.06 


160 


52 


- 


- 





PG0844+349 


0103660201 


08:47:42.30 


+34:45:04.90 


9783.5 


Medium 


3.28 


08:47:42.9 


+34:45:03.27 


200 


40 


- 


- 





€10939+4713° 


0106460101 


09:43:00.10 


+46:59:29.90 


43690.0 


Thin 


1.24 


09:43:01.8 


+46:59:44.37 


160 





- 


- 





B21028+3r 


0102040301 


10:30:59.10 


+31:02:56.00 


23236.0 


Thin 


1.94 


10:30:59.3 


+31:02:56.08 


140 


72 


- 


- 





621128+31" 


0102040201 


11:31:09.40 


+31:14:07.00 


13799.8 


Thin 


2.00 


11:31:09.6 


+31:14:06.02 


140 


44 


- 


- 





Mkn205'''' 


0124110101 


12:21:44.00 


+75:18:37.00 


17199.6 


Medium 


3.02 


12:21:43.8 


+75:18:39.08 


140 


36 


- 


- 





MS 1229.2+6430°'' 


0124900101 


12:31:32.32 


+64:14:21.00 


28700.0 


Thin 


1.98 


12:31:31.2 


+64:14:18.06 


140 


40 


- 


- 





HD111812* 


0008220201 


12:50:42.56 


+27:26:07.70 


37338.8 


Thick 


0.90 


12:51:42.6 


+27:32:23.27 


168 


40 


- 


- 





NGC4968 


0002940101 


13:07:06.10 


-23:40:43.00 


4898.7 


Medium 


9.14 


13:07:06.3 


-23:40:33.23 


40 





- 


- 





NGC5044 


0037950101 


13:15:24.10 


-16:23:06.00 


20030.0 


Medium 


5.03 


13:15:24.2 


-16:23:08.53 


340 





- 


- 





IC883 


0093640401 


13:20:35.51 


+34:08:20.50 


15849.4 


Medium 


0.99 


13:20:35.4 


+34:08:21.37 


48 





13:20:54.4 


+33:55:17.26 


104 


HDl 17555°* 


0100240201 


13:30:47.10 


+24:13:58.00 


33225.4 


Medium 


1.16 


13:30:47.8 


+24:13:51.07 


160 


40 


- 


- 





F278 


0061940101 


13:31:52.37 


+ 11:16:48.70 


4648.3 


Thin 


1.93 


13:31:52.4 


+ 11:16:43.88 


48 





- 


- 





A 1837° 


0109910101 


14:01:34.68 


-11:07:37.20 


45361.3 


Thin 


4.38 


14:01:36.5 


-11:07:43.14 


440 





- 


- 





UZLib° 


0100240801 


15:32:23.00 


-08:32:05.00 


23391.2 


Medium 


8.97 


15:32:23.4 


-08:32:05.32 


140 


40 









FieldVI 


0067340601 


16:07:13.50 


+08:04:42.00 


9634.0 


Medium 


4.00 



















PKS2126-15° 


0103060101 


21:29:12.20 


-15:38:41.00 


16150.0 


Medium 


5.00 


21:29:12.1 


-15:38:40.44 


120 


40 









PKS2135-14°'^ 


0092850201 


21:37:45.45 


-14:32:55.40 


28484.3 


Medium 


4.70 


21:37:45.1 


-14:32:55.22 


120 


44 









MS2137.3-2353 


0008830101 


21:40:15.00 


-23:39:41.00 


9880.0 


Thin 


3.50 


21:40:15.1 


-23:39:39.32 


140 


48 









PB5062° 


0012440301 


22:05:09.90 


-01:55:18.10 


28340.9 


Thin 


6.17 


22:05:10.3 


-01:55:20.38 


140 


40 









LBQS2212-1759° 


0106660101 


22:15:31.67 


-17:44:05.00 


90892.5 


Thin 


2.39 



















PHL5200° 


0100440101 


22:28:30.40 


-05:18:55.00 


43278.5 


Thick 


5.26 


22:28:30.4 


-05:18:53.12 


16 












IRAS22491-1808° 


0081340901 


22:51:49.49 


-17:52:23.20 


19867.2 


Medium 


2.71 


22:51:49.4 


-17:52:25.02 


32 












EQPeg" 


0112880301 


23:31:50.00 


+ 19:56:17.00 


12200.0 


Thick 


4.25 


23:31:52.7 


+ 19:56:18.46 


160 


48 









HD223460 


0100241001 


23:49:41.00 


+36:25:33.00 


6699.0 


Thick 


8.25 


23:49:40.8 


+36:25:32.56 


172 


40 


23:50:02.0 


+36:25:36.36 


116 



° Fields belonging to the XMS 

Fields excluded from the angular correlation studies 

eposcorr failed in these fields 

In common with SXDS (Ueda et al. l2006] | 

' In common with HELLAS2XMM (Baldi etal.l2002t 
f Same aiea as one ChaMP field (Kim et al. 12004b 
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meaning that the presence of that source in that band does not 
improve the fit (see below for details). 

The standard SAS products include X-ray source lists (cre- 
ated by emldetect) for each of the three EPIC cameras (MOSl, 
MOS2 -Turner et al. |2001T and pn -Struder et al. I2001f ). run in 
five independent bands (1: 0.2-0.5 keV, 2: 0.5-2.0 keV, 3: 2.0- 
4.5 keV, 4: 4.5-7.5 keV, and 5: 7.5-12.0 keV). In addition, the 
source detection algorithm was run in the 0.5-4.5 keV (band 9), 
which we will call the XID band. Since EPIC pn has the high- 
est sensitivity of the three cameras, we have used only pn source 
lists. We have not allowed for source extent when detecting the 
sources, and therefore all the sources in our survey are treated as 
pointlike. 

The internal calibration of the source positions from the SAS 
is quite good (1.5 arcsec, Ehle et al. '2005 ), but there may be 
some systematic differences between the absolute X-ray source 
positions and optical reference frames. We have registered the 
X-ray source positions to the USNO-A2 reference frame field by 
field, using the SAS task eposcorr. This task shifts the X-ray 
reference frame to minimise the differences between the X-ray 
source positions and the positions of their optical counterparts in 
a given reference astrometric catalogue (USNO-A2 in our case). 
These corrected positions are given in Table|2] The average shift 
in absolute value in R.A. (Dec.) was 1.3 (0.9) arcsec, and in all 
cases the shifts were under 3 arcsec. The average number of X- 
ray-USNO-A2 matches used to calculate the shifts was 28 (the 
minimum was 16 and the maximum 82). There were two fields 
for which this procedure failed (Mkn3 and A399), probably be- 
cause the first one had one quarter of the pn chips in count- 
ing mode, and the second has two moderately strong extended 
sources in opposite corners of the field. We therefore used the 
original X-ray source positions for these two fields. 

2.3. Source selection 

In each detection band, the source detection algorithm fits a por- 
tion of the image around the position of the candidate source 
trying to match the Point Spread Function (PSF) shape to the 
photon distribution. In this process it uses the background map, 
and the exposure map to fit a source count rate and 2D posi- 
tion. A typical size of this region is the 80% encircled energy 
radius, which corresponds to about 5 pixels on-axis and 7 pixels 
at 15 arcmin off-axis (we have used throughout 4 arcsec pixels). 
Sources closer than this distance to pn chip edges, could have 
a worse determination of their positions and/or count rates. We 
have therefore excluded all sources closer than the 80% encir- 
cled energy radius (taking into account the off-axis angle) to any 
pn chip edges. A region (~12 pixels wide) on the readout (outer) 
edge of the pn chips is masked out on board the satellite (Ehle et 
al. 12005b . We have considered this "effective" edge as the outer 
edge of each chip when defining our excluded regions. 

We have visually inspected all X-ray images, excluding cir- 
cular regions around bright targets, and other bright/extended 
sources in the images (see Table [T] for the sizes and positions 
of these regions). In a few images, a bright band extending from 
the target to the pn chip reading edge was visible, due to the pho- 
tons arriving at the detector while it was being read (called Out 
Of Time -OOT- region). We have excluded a rectangular region 
around the OOT region, whose width is also given in Table [T] In 
addition, sources affected by bright pixels/segments have been 
excluded, as well as those which were obviously affected by the 
presence of nearby bright sources, or split by the chip gaps, and 
were not picked up by the above procedures. 
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In addition, there were two sets of fields which partially over- 
lapped. G133-69pos^ and G133-69Pos_l on the one hand, and 
SDS-lb, SDS-2 and SDS-3 on the other We have dealt with this 
by masking out the portion of the second (and third) field which 
overlap with the first field, in the order given above. 

We have used the following bands in the analysis performed 
in this paper: 

- soft: 0.5-2 keV, identical to the standard SAS band 2 

- hard: combining standard SAS bands 3, 4 and 5, and hence 
corresponds to counts in the 2 to ~12 keV range. However, 
we have calculated (and will quote) all hard fluxes in 2-10 
keV. 

- XID: 0.5-4.5 keV, identical to the SAS band 9 

- ultra-hard: 4.5-7.5 keV, identical to the standard SAS band 4 

The detection likelihood in the hard band (L345) was obtained 
from the detection likelihoods in bands 3, 4 and 5 (L3, L4, and L5 
respectively) using L345 = -log(l -2(5/2, L^-hL^-i-L^)), where 
can be obtained from L, = -log(l - Q(3/2, Lp) and Q(a,x) 
is the incomplete gamma function (see manual for emldetect). 

There were a total of 2560 accepted sources with a 
emldetect detection likelihood > 10 (the default value) in at 
least one band, which are listed in Table |2] with their corrected 
X-ray positions, and count rates in the standard SAS and XID 
bands. 

The original unfiltered source lists are identical to the ones 
used by Mateos et al. (I2005I I in their study of the detailed spectral 
properties of medium flux X-ray sources, for the common fields. 
They also used similar criteria for excluding sources close to the 
pn chip edges, except for the readout edge. The differences in the 
final accepted source lists arose from several reasons: Mateos 
et al. used sources close to pn chip gaps (which we have ex- 
cluded) if they were far from chip gaps in the MOS detectors. 
They excluded from their spectral analysis the sources with low 
number of counts. Finally, we have treated slightly diff'erently 
the sources close to the exclusion zone boundaries. These diff'er- 
ences are at the 10% level: out of the final 1 137 accepted sources 
in the Mateos et al. sample, only 1 19 would have been excluded 
by our criteria. 

2.4. Sensitivity maps 

The value of the sensitivity map at a given point is the minimum 
count rate that a source should have to be detected with the de- 
sired likelihood at that point. As explained above, the detection 
likelihood assigned by emldetect takes into account the num- 
ber of counts in the detection box, how well do they fit the PSF 
shape, and the variation of the exposure map over the detection 
box. The likelihood is therefore in principle not trivially related 
to the Poisson probability of an excess in the number of detected 
counts over the expected background in the detection box. 

However, we have found (see Appendix |A]i that the count 
rate assigned by the software to the source (the "observed" count 
rate) is proportional to the count rate expected from a Poisson 
distribution for the same likelihood (for detection likelihoods 
between 8 and 20), with proportionality constant ~ 0.9 - 1.1, 
depending on the band. It appears that the non-Poisson charac- 
teristics taken into account by emldetect have a relatively small 
influence in the determination of the count rate. For a given like- 
lihood, radius of the detection region, total value of the back- 
ground map, and average value of the exposure map in that re- 
gion, we calculate the expected Poisson count rate at each point 
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of the detector and, using the proportionality constants, the cor- 
responding "observed" count rate, i.e., the value of the sensitiv- 
ity map (see AppendixlAlfor further details). 

We have used the above procedure to create sensitivity maps 
for each field in each band, in image {X, Y) coordinates, which 
have 4 arcsec pixels, have sky North to the top and East to the 
left, and are centred approximately in the optical axis of the X- 
ray telescope. We have also taken into account for each field 
the areas excluded close to the detector edges, and around the 
bright sources and OOT regions, excluding those regions from 
the sensitivity maps as well. 

The final selection of sources in each band was done using 
their detection likelihood and the corresponding sensitivity map 
at their sky position (to ensure the validity of the sky areas calcu- 
lated from the sensitivity maps, see Section |4Tt . We have chosen 
a detection confidence limit of 5-cr, which corresponds to L = 15 
in the band under consideration. We have also imposed that the 
source has a count rate equal to or larger than the value of the 
sensitivity map at their sky position, to ensure that the source 
detection is reliable (this excluded less than 5% of the L > 15 
sources in the soft band, ~10% of the sources in the XID and 
ultra-hard bands, and ~20% of the sources in the hard band). 
The number of sources excluded by this criterion is much larger 
in the hard band than in the other bands. This is somewhat con- 
tradictory with the proportionality between the detected count 
rate and the Poisson count rate for this band being in a differ- 
ent direction than in the other bands (~10% smaller rather than 
~10% larger, see Table I A. ll , since the sensitivity map will then 
be relatively lower, and therefore would tend to include more 
sources, rather than exclude them. In any case, the hard band is 
the widest, and the only one of our bands which is not one of 
the bands used for source searching in the SAS, being instead 
a composite of three default bands. In principle, all this makes 
the hard band the more complex to deal with, and for which the 
uncertainties associated with our empirical method to calculate 
the sensitivity maps might be the highest. 

The total number of distinct selected sources (i.e., fulfilling 
the above criteria in at least one of the bands) is 1433. We give in 
Table|4]the total number of selected sources in each of the above 
bands. Unless explicitly stated otherwise, for the logA^-logS' and 
the angular clustering analyses below we have only used sources 
detected in the corresponding bands. However, it is important to 
emphasise that the fact that, if a source has been detected in at 
least one band, it can be considered as a real source, and there- 
fore its counts in other bands can be used (e.g., for spectral fitting 
or hardness ratio calculations), even if the source has a count rate 
smaller than the detection threshold in those bands. 

To estimate the number of spurious sources in our survey 
we first need to calculate the number of independent source de- 
tection cells, emldetect uses an input list from eboxdetect 
(which is a simple sliding-box algorithm which uses a square 
5x5 pixels detection cell), so the individual detection cell is 
5 X 5 X (4")^ - 400 arcsec^. The total area of our survey is 
~4.8 deg^ (see Section|4]TJ, or about 155520 independent detec- 
tion cells. Since the probability of a false detection at the 5-cr 
level is 0.000057, this corresponds to about 9 spurious sources 
in each of our detection bands. This is less than 1% in the soft 
and XID bands, about 2% in the hard band, and almost 10% in 
the ultra-hard band. The latter fraction could explain in part the 
discrepant point in the ultra-hard logA^-logS' at the lowest fluxes 
(see Section 14.5b . since it is there where the contribution from 
spurious sources is expected to be highest. 



3. X-ray properties of the sources 

We have studied the broad spectral characteristics of our sources 
by fitting their count rates in several bands, to those expected 
from power-law spectra with photon index F and Galactic 
Hydrogen column densities (Nh) from 21cm radio measure- 
ments (Dickey & Lockman 1990 see Table [TJ. The spectra of a 
few sources will obviously not be well represented by a power- 
law (e.g. stars or clusters showing thermal spectra), but for the 
purpose of calculating fluxes in bands in which the power-law 
was fit, a power-law is a reasonably accurate and very simple 
approximation. 

The expected count rates for diff'erent values of T (from -10 
to 10 in steps of 0.5, interpolating linearly for intermediate val- 
ues), and the Galactic Nh values of each field were calculated 
with xspec (ArnaudI 19961 1, using the "canned" on-axis redis- 
tribution matrix files, and on-axis effective areas for each field, 
created with the SAS task arfgen. The inaccuracies from using 
standard response matrices instead of source specific matrices 
(as generated by rmfgen) are expected to be small, since we 
only use broad bands, much broader than the spectral resolution 
of the EPIC pn camera. Since the count rates from emldetect 
are corrected for the exposure map (which includes vignetting, 
and bad pixel corrections) and the PSF enclosed energy fraction, 
the effective areas were generated disabling the vignetting and 
PSF corrections as indicated in the arfgen manual. 

The corresponding fluxes in the bands 2 to 5 (in the case of 
band 5, the flux was calculated in the 7.5 to 10 keV band) were 
also calculated using the spectral model for the same values of F, 
setting Nh - 0. We have checked that the relatively coarse sam- 
pling does not introduce any biases in our spectral fits by repeat- 
ing the spectral fits for one field with a step in F of 0.001. The 
results were practically identical, any differences in the spectral 
slopes being much smaller than their uncertainties. 

We have performed spectral fits in bands 2 and 3 (for the soft 
and XID fluxes), and bands 3, 4 and 5 (for the hard and ultra- 
hard fluxes). The average count rates of our selected sources in 
the XID and hard bands are 0.0095 and 0.0032, respectively, 
which for a typical exposure time of 15 ks, give an average of 
more than 10 counts per bin, which ensures that Gaussian statis- 
tics is a good approximation. However, they are not sufficient 
in many cases to warrant a detailed spectral fitting with xspec. 
The best fit F and flux were calculated by minimising the be- 
tween the observed and expected count rates. The minimisation 
was actually done in F, setting the flux from the normalisation 
that minimised the x' in the corresponding band. One sigma er- 
ror bars for the photon index and flux were obtained from the 
values which produced Ax^ - 1 from the minimum. These error 
bars were asymmetric in most cases, but we have used a sym- 
metric error bar (the arithmetic average of the upper and lower 
error bars) for the weighted averages of the spectral slopes and 
the logA^-log5 . In a few cases the fitted photon indices are very 
steep I F |~ 10, in all cases this corresponds to sources with posi- 
tive count rates in only one of the fitted bands, forcing the power 
law to the steepest allowed slope. The number of these patho- 
logical cases in each band can be obtained from the difference 
between the and A^ave columns in Table |4] (typically < 10%). 

The photon indices and fluxes in each of the fitted bands, 
as well as their error bars are given for each source in Table 
|3] We show in Table |4] the weighted average photon indices of 
the detected sources in each of the fitted bands, as well as the 
number of sources used in the averages, excluding in all cases 
sources with | F |> 9, to avoid biasing the averages with a few 
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Table 4. Number of sources selected in different bands A^, weighted average slopes (F) and errors (taking into account both the 
error bars in the individual F and the dispersion around the mean), and number of observed sources used in the average A^ave- The 
soft, hard, XID and ultra-hard bands are as defined in the text. "Soft and hard" refers to sources selected simultaneously in the soft 
and hard bands, "Only soft" refers to sources selected in the soft band but not in the hard band, and "Only hard" refers to sources 
selected in the hard band and not in the soft band 



0.5-2 keV 2-10 keV 



Selection 


N 


N 

-"ave 


<r> 


^ ' ave 


<r) 


Soft 


1267 


1239 


1.811 ±0.015 


1145 


1.53 ±0.02 


Hard 


397 


397 


1.76 ±0.03 


394 


1.55 ±0.03 


XID 


1359 


1335 


1.773 ±0.016 


1244 


1.47 ±0.02 


Ultra-hard 


91 


91 


1.80 ±0.06 


91 


1.50 ±0.07 


Soft and hard 


345 


345 


1.79 ±0.03 


342 


1.67 ±0.03 


Only soft 


922 


894 


1.878 ±0.017 


803 


1.04 ±0.04 


Only hard 


52 


52 


-0.29 ± 0.08 


52 


0.53 ±0.12 



outliers. The impact from these sources would be small for the 
weighted averages used here. 

We have checked the reliability of our simple spectral fit 
method (see Appendix |B]i, performing both internal tests with 
simulations, and external checks with respect to the full spec- 
tral fits of Mateos et al. (I2005I I to a set of sources with a large 
overlap. We conclude that our 2-3 band spectral slopes are good 
when taken individually for the purposes of calculating fluxes 
in different bands. However, we have found that there are sig- 
nificant systematic biases in the average photon indices (when 
averaged over large samples) which are larger than the statistical 
errors. These systematic biases are nonetheless small in both ab- 
solute (AF ~ 0.12) and relative (AF/F = 0.2) terms. Our method 
is therefore adequate for extracting broad spectral information 
from medium flux X-ray sources, such as those in the IXMM 
and 2XMM catalogues. 

In broad terms, the soft/XID band slopes are ~ 1 .8 for sources 
detected in all bands, being slightly softer for sources detected 
only in the soft band, and much harder for sources detected only 
in the hard band (see Table |4]i. The hard band slopes are flatter, 
closer to 1.5-1.6, but slightly steeper for sources detected both 
in the soft and hard bands (~1.7). The average hard slope of the 
sources only detected in the soft band is quite flat (F) ~ 1, but 
their un-weighted average is (F) - 1 .53+0.08, much closer to the 
other average slopes in that band. The origin of this difference 
is that "Only soft" sources with steep spectra in the hard band 
tend to have larger errors on the hard spectral slope (because 
they have few counts in XMM-Newton bands 4 and 5, and so 
the slope is not well constrained), and therefore they have a very 
small weight in the weighted average. 

The difference between the spectral slopes of sources only 
detected in in the soft band and those only detected in the hard 
band is partly due to a known bias in likelihood limited surveys 
(such as ours), that occurs because source detection is done in 
photons rather than in flux (Zamorani et al. il988i Delia Ceca et 
al. 119991 ), so that for a given flux, softer sources will be much 
easier to detect in the soft band, while the inverse would be true 
for harder sources. This is compounded by the fact that XMM- 
Newton is much more sensitive to soft photons. 



4. The logiV-log5 

We have first studied the sky density of the detected sources as a 
function of flux (known as the logA^-logS' ) in the soft, hard, XID 
and ultra-hard bands. 




Flux (cgs) 



Fig. 1. Sky area as a function of flux for different bands 



4.1. Sky areas 

The sky area over which we are sensitive to a given flux is easy 
to calculate from the sensitivity maps. In principle, we just need 
to sum the sky area over which the sensitivity maps have values 
below the desired flux. However, the conversion between flux 
and count rate depends on the assumed spectral shape. In each 
band /, we have calculated the sky areas at each flux for each 
assumed spectral slope f2,(5, F) with the following procedure: in 
each field, we have converted S to count rate using F and the re- 
sponse matrices, and summed the area over which the sensitivity 
map is below that count rate. The total sky area is the sum of the 
areas found for each field. 

The sky area in each band for each source j is then Oy , = 
Q.i(S jj,Tji), where the source's flux S jj and spectral slope F^, 
are described in Section[3] 

Sky areas independent of any assumed spectrum can also 
be obtained by weighting 0,(5',F) with the number of detected 
sources in each (S, F) bin. We will call these spectrally averaged 
sky areas f2,(5). They are shown in Fig.[T] The maximum value 
of all curves is 4.8 deg^, which is the geometric area covered by 
our survey, taking into account the excluded areas. 

We have also obtained spectrally averaged sky areas for each 
field Q,;i;(5') (k = I ...number of fields) from the sensitivity 
maps for each field k in each band /, using S (F) for that field, 
and weighting with the number of sources in the total sample 
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with the corresponding 5',r, assuming imphcitly that all fields 
have the same S (F) distribution. 



4.2. Data from other surveys 

Our survey consists of XMM-Newton exposures with a typical 
exposure time of about 15 ks, and it is hence a medium survey. 
Shallower wider area surveys are required to obtain significant 
numbers of bright sources, while deeper pencil-beam surveys 
will probe fainter fluxes. We have combined our survey with 
both shallower and deeper surveys to obtain a wide coverage 
in flux (~2-4 orders of magnitude): 



- BSS: DeUa Ceca et al. (2004) have constructed a bright sam- 
ple of XMM-Newton sources down to a flux of 7 x 10"'^ cgs 
in the XID band, with a uniform coverage of 28. 1 deg^ above 
that flux. This sample is ideally complementary to our XID, 
being both wider and shallower, and selected exactly in the 
same band with the same observatory (but with a different 
detector: MOS2 instead of pn). We have used a total of 389 
sources from that survey, including all sources identified as 
stars, which have been excluded from the logA^-log5 analy- 
sis of Delia Ceca et al. (2004) 

- HBS: In the same paper, Delia Ceca et al. (2004) also de- 
fine a sample of sources detected in the ultra-hard band, 
down to the same flux limit, and with a uniform coverage 
of 25. 17 deg^, again both shallower and wider than ours and 
with the same observatory. Results from a subsample are pre- 
sented by Caccianiga et al. (120041) . The source counts are 
also discussed by Delia Ceca et al. (120041) and compared to 
previous results from BeppoSAX. We have used a total of 65 
sources from this survey 

- CDF: The deepest survey in the soft and hard bands so 
far is the Chandra Deep Field North (CDF-N) Bauer et al. 
(120041 ). obtained from a total exposure of 2 Ms in a loca- 
tion in the Northern hemisphere. In a complementary effort 
in the South, the Chandra Deep Field South (Giacconi et al. 
12001 Rosati et al. 2002) gathered a total of 1 Ms. Source 
counts from both samples have been discussed in Bauer et al. 
(12004 ). There are a total of 442 soft and 313 hard sources in 
tiie CDF-N, and 282 soft and 186 hard sources in the CDF-S. 
Recent internal Chandra calibrations have increased the esti- 
mate of the ACIS eff'ective area above 2 keV. We have used 
the applications on the Chandra calibration database to com- 
pare data from Cycles 8 and 5: above 2 keV the ratio between 
the eff'ective areas is reasonably flat, and well approximated 
by a constant increase factor of about 12%. The increase in 
the 2-8 keV fluxes is unlikely to be due to the increasing 
contamination from the optical blocking filter, since the lat- 
ter only manifests itself below 1 keV. Therefore, all CDF- 
N and CDF-S hard fluxes (and their corresponding errors) 
have been decreased by this factor. Furthermore, when com- 
paring the sky areas calculated individually for each source 
with those calculated interpolating from the sky area of the 
full survey, the faintest soft sources in both CDF surveys 
showed large (up to several orders of magnitude) discrepan- 
cies. Since our maximum likelihood logA^-log5 fit method 
requires the use of a model for the sky area of the full survey 
(see Section 1431 ). we have not used soft sources fainter than 
3 and 7x10 cgs in the CDF-N and CDF-S respectively in 
the logA^-log5 fit. 

- AMSS: The ASCA Medium Sensitivity Survey (Ueda et al. 
120051 ) is one of the largest high Galactic latitude broad-band 



X-ray surveys to date, including 606 sources over 278 deg^ 
with hard band fluxes between ~ 10"'"^ and ~ 10"" cgs. 

The average sky area as a function of flux for different bands 
for the CD F and AMSS are shown in Bauer et al. (2004) and 
Ueda et al. (120051 ). respectively. We obtained them from the re- 
spective first authors, and we have used them for the Maximum 
Likelihood fit (see Section l43T ). The Chandra re-calibration has 
also been applied to the CDF effective area fluxes. 



4.3. Construction of ttie binned logN-logS 

The binned diff'erential logA^-log5 (number of sources per unit 
flux and unit sky area at a given flux S: dN/dSdQ.) have been 
constructed summing the inverse of the Q^, for the sources in 
each flux bin, and dividing that number by the width of the bin. 
The errors are calculated dividing the dN/dS dQ. by the square 
root of the sources in each bin. Since we have chosen to have 
a minimum number of sources per bin (see below), their widths 
are all in principle different, determined by the flux of the first 
source in the bin and the flux of the first source in the next bin 
going up in flux. With this definition, the width of the last bin 
was left undefined. Since we have a large number of sources, 
we have dropped the brightest source in each sample (only when 
dealing with binned logA^-logS' ), defining the upper limit in the 
last bin to be the flux of this brightest source. 

The integral logA^-logS' (number of sources per unit sky area 
with fluxes higher than S : N{> S )) from our sample and other 
samples are shown in Fig. |2] We have chosen to plot the inte- 
gral logA^-log5 in bins containing 15 sources each (except the 
last one, which can contain up to 29 sources), to avoid appar- 
ent features due to fluctuations of a few sources, specially at 
the brighter ends of each sample, were the number of sources 
is low. We have simply added the inverse of the sky areas for 
each source Q^ ,, for all sources with fluxes above the lower limit 
of each bin. The eiTor bars are calculated dividing the A^(> 5 ) by 
the square root of the total number of sources with flux equal or 
greater than the lower limit of the bin. 

Sources just below the faint flux limit of a survey could expe- 
rience statistical fluctuations of their fluxes, promoting them into 
the survey, the fainter sources just above that limit could drop 
from the survey for the same reason, but since fainter sources 
are much more abundant (because the source counts are steep), 
the net effect is to increase artificially the observed source counts 
close to the fainter survey limits. This is known as the Eddington 
bias (Eddington |1913l ), and sometimes causes a re-steepening of 
the logA^-log5 at the faintest fluxes, as observed in the AXIS 
ultra-hard source counts, for example. 



4.4. logN-logS model 



Previous X-ray source count results (e.g. Bauer et al. 120041 Ueda 
et al. 2003 Moretti et al. |2003l Baldi et al. |200?, Hasinger et 
al. 1998 Cagnoni, Delia Ceca & Maccacaro |l998i ) have shown 
that the logA^-log5 is well approximated by a steep power law at 
bright fluxes, flattening at lower fluxes. We have hence adopted 
the following model for the differential logA^-log5 : 



dN 
dSdO. 




,S <St 
,S >Sb 



The above model has four independent parameters: the break 
flux S h, the normalisation K, the slope at high fluxes r„, and the 
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Table 5. Maximum likelihood fit results to the logA^-log5 in different bands and using different samples: the first column is the band 
used, the second is the power-law slope above the flux break, the third is the slope below that break, the fourth is the flux break, the 
fifth is the normalisation, the last six columns indicate the number of sources from each sample used in the fit. 



A'used/A'lot 



Band 


r„ 




(10-"' cgs) 


K 

(deg-2) 


AXIS 


BSS 


HBS 


CDF-N 


CDF-S 


AMSS 


softs 

soft 

soft^ 


2 4Q+0.06 

7 aQ+H;8^ 
-0 06 
9 ag+0.14 
^•-'°-0.05l 


J 74+0.06 
1 69+88? 
^■-'"-o.oi 


1 15+°'** 

* -0 n 
1 02+°'* 


120.0+22 
123.4+]*-? 
141.4+«3' 


1267/1267 
1267/1267 
1267/1267 






429/442" 


269/282* 




hards 

hard 

hard 

hard 

hard 

hard 

hard 

hard 

hard-f 


2 74+0.08 

9 79+8:8? 

-Q OS 

2 03+012 

Z,.UO_(, [[ 
9 11+0.50 

2 12 

T ^/;+8o8 

9 5O+0.02 
-^°-0.02 
9 CO +0.25 

-Q-I8 
9 SQ+0.I7 


1 00+"" 
0.89+"'^ 

1 10+8:"^^ 

-!!Q1 
1 on+OOl 

1 ic+oi't 

-««? 


1.00 
1.00 

30+° '" 

"■-'"-0.05 

0.72+°-" 

Q44;8:A3 

1 nn+0 u8 

1.00 

92+°-^^ 

1 i7+8:A? 

^•^ '-0.05 


735 +'2 

684.4:«-' 

1086.6!]1« 
743 7+109.6 

r 1 1 c+49 1 

611.5+*^j 
606.5!J«J 
607.8!^|;^:t 
485.3!^::^ 


348/397' 
348/397' 

397/397 
348/397' 

397/397 






313/313 

313/313 
313/313 

313/313 
313/313 


186/186 
186/186 
186/186 

186/186 
186/186 


606/606 
606/606 
606/606 


XIDs 

XID 

XID^ 


9 ^Q+0.05 
^•-"-0.20 

2 45+0.11 
9 ';d+o.03 

^•■^ -0.04 


1 Q7+0.09 
^■■-"-0.32 
J 29+0 09 

1 oi;+8:(')i 

-0.25 


imi°% 

1 4«:+0.16 
64+8^^ 

^•"^-0.28 


265 !f 14 

919 0+47.4 

193.0!]|^ 


1359/1359 
1359/1359 
1359/1359 


389/389 










ultra-hards 

ultra-hard 

ultra-hard'^ 


2.63+°'^ 

2 gcj+BlA^ 

2 62+8:''^ 

^•"■^-0.10 




1.00 
1.00 
1.00 


102 +" 
95-0+11^ 
102.2!2°-o 


84/89'' 
84/89'' 
84/89'' 




58/65'- 









" Sn,i,,CDF-N = 3X10-'' cgs 

* Snjin^DF-S = 6X 10-'^ CgS 
5min.AXIS = 1-5 X 10-'" CgS 

'' 5' min.AXIS = 10 '" CgS 

' 5„,„.HBs =2.5x lO-'^cgs 

^ Best fit used for contribution to XRB, and in Figs.|2]and|3] 

s Using fixed photon indices of 1.8 in the soft and XID bands, and 1.7 in the hard and ultra-hard bands. 



slope at low fluxes Yd- If the change in the slope of the logA^- 
log5 is not significant, we fixed Sb = IQ-''" cgs and F,, = F^, 
leaving only two independent variables: K and r„. 
The integral logAf-log5is therefore: 



m>s) 



dS- 



dN 
dSdD. 



{S) 



Finally, assuming a given logA^-logS' , the number of sources 
with fluxes between S'niin and 5 max is 



^('^min < 5 < 5 max) 



This number is the total expected number of sources in our 
survey A with fluxes in band / in that interval if Q(5) = 
defined above, or it can be Ak the expected number of sources in 
that flux interval and band in a given field k if Q(5) - il, 

The contribution to the intensity of the XRB from the interval 

(•^ min; max) IS 



^('5 mill S < 5 max) 



Js„, 



(1) 



of the sources, as well as the changing shape of the sky areas 
with flux. 

Specifically, we have minimised the following expression: 



^tot — ^ j ^sample 
sample 



where Lsampie is given by 



isample ^ '^^j ^OS(P(S j)) " 2 log(P^(N)) 

./=I 



(2) 



where the sum is over all the A' sources in the corresponding 
sample, and P{S j) is the probability of finding a source of flux 
S i in that sample (see below). The second term is PAiN) = 
e^'^A'^ IN\ the Poisson probability of finding A^ sources in that 
sample when the expected number is A (see Section |4~4] i. 
We have defined P{S j) as 



4.5. Maximum likelihood fit method and results 

We have fitted the logA^-logS' using a Maximum Likelihood 
method which takes into account the uncertainties on the fluxes 



This expression takes into account both the variation of the 
sky area with flux (through Q(5')), and the uncertainty in the 
flux of the source crj (Page et al. 120001) . through the last ex- 
ponential term in the numerator, in which we have assumed a 
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Gaussian distribution of the fluxes around the measured value. 
To speed up the numerical calculation of the integral in the nu- 
merator above, we have defined S ^^j^^ = max(5 j - Actj, S min) and 
'^max - min(5; + 40"^, 5max), since the tails of the Gaussian dis- 
tribution decrease very quickly. The normalisation of the logA^- 
logS (K) appears both in the numerator and the denominator of 
P{S j) and it is unconstrained. This is why we have introduced 
the second term in Eq.|2l 

The 1-cr uncertainties in the logA^-log5 parameters are es- 
timated from the range of each parameter around the minimum 
which makes ALtot = 1 ■ For each parameter, this is done by fix- 
ing the parameter of interest to a value close to the best fit value, 
and varying the rest of the parameters until a new minimum for 
the likelihood is found, this is repeated for several values of the 
parameter until this new minimum equals Ltot.min + 1 ■ 

The results of the Maximum Likelihood fits to various (sin- 
gle and combined) samples and bands are given in Table |5] 
Except when stated otherwise, the flux interval used in the fit 
is Sjnin = 10"'^ cgs and Smax = 10"'^ cgs. These numbers have 
been chosen to span the observed fluxes of the sources. The final 
results are not very dependent on them, since the sky area falls 
very quickly at low fluxes, and the sky density of bright sources 
is very small. The initial values for the numerical search for the 
best fit have been obtained from a fit to the total binned dif- 
ferential logA^-logS' . The best overall fits are marked in the first 
column of Table|5]and also shown in Fig.|2l 



The first two rows in Table |5]for each band allow comparing 
the results of the fit to the AXIS sources using a fixed spectral 
slope to calculate fluxes and sky areas (T - 1.8 in the soft and 
XID bands, F = 1 .7 in the hard and ultra-hard bands, first row), 
to the results using the best fit spectral slope for each source 
(second row). The results are mutually compatible in all bands, 
but the error bars are noticeably larger in the XID and ultra-hard 
band when using a fixed spectral slope. Since by fitting the spec- 
tra of the sources we are "forcing" them to have a power law 
spectral shape, the uncertainties in the fitted fluxes are smaller 
than those in the fluxes with fixed spectral slopes. This might 
explain at least in part the smaller uncertainties in the logA^- 
logS fitted parameters in the latter case. In what follows, we have 
hence used the slopes from spectral fits to calculate fluxes from 
count rates for all the AXIS sources, since this approach seems 
to produce smaller uncertainties in the fitted parameters. 

There is general agreement between data and model fits, but 
it is difficult to quantify this statement, since, unlike the;^'^ statis- 
tic, the absolute value of Ltot is not an indicator of the goodness 
of the fit. Each panel in Fig. |2] covers several orders of magni- 
tude, and hence a detailed visual comparison is also difficult. We 
have therefore plotted in Fig. [3] the ratio between the binned dif- 
ferential logA^-log5 and the best fit model. Systematic deviations 
from unity in those plots would reveal differences between the 
data and the best fit model. In that Figure we also show the 1- 
cr uncertainty interval on the best fit, estimated in a conservative 
way: for each flux, we have calculated the differential logA^-log5 
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Fig. 3. Ratio between the binned differential logA^-log5 and the best fit model in different bands. The 1-cr uncertainty interval on 
the logA^-logS' is also shown as bow-tie-shaped dashed lines above and below unity. 



in the 16 corners of the hypercube defined by the 1-cr uncer- 
tainty intervals in the best fit logA^-log5 parameters, and taken 
the maximum and minimum values. 

As expected, the relative agreement of the AXIS and 
BSS/HBS source counts is quite good, merging with each other 
well, and following the same logA^-log5 shape. This confirms 
the good relative calibration of the two EPIC cameras on board 
XMM-Newton (pn for AXIS and MOS2 for BSS/HBS). The 
XMM-COSMOS logA^-log5 results (Cappelluti et al. ,2007i) . are 
also consistent with ours within 1 to 2-cr in the soft and hard 
bands, but the uncertainties in our best fit parameters are smaller, 
probably due to our much higher number of sources and wider 
flux coverage in those bands. 

The agreement with the CDF samples is very good. In the 
soft band, our joint AXIS-CDF fit is in excellent agreement with 
the CDF results of Bauer et al. ( |2004b (Tj = 1.55 + 0.03), as 
expected since virtually all sources at low fluxes are the same in 
both samples. Our AXIS-only fit prefers a steeper slope below 
the break, but still compatible with the joint AXIS-CDF fit at < 
2cr. The agreement with Moretti et al. (2003J is very good below 
the break (Fj = 1 .60^g g3, hereafter we have translated all inte- 
gral logA^-logS' slopes to our differential slopes), but their break 
happens at higher fluxes (Si, = (1.5 + 0.3) x lO"''* cgs), result- 
ing in a steeper slope at high fluxes (Fj, = 2.82;^y yg). Our results 
are fully compatible with those of the HELLAS2XMM (Baldi et 



al. 120021 ) sample at their low flux end, but our source counts are 
systematically at their lower envelope above the break. 

In the hard band, the CDF, AXIS and AMSS logA^-log5 
match well in their overlapping regions. However, the ratio in 
the top-right panel of Fig.[3]is rather wiggly and shows clear dif- 
ferences in detail. The AXIS data are mostly below the model, 
perhaps because (see Section 12.41 ) the high fraction of sources 
excluded for being fainter than the sensitivity map, and the rel- 
atively lower value of the sensitivity map with respect to the 
Poisson count rate (which would in turn increase the effective 
area). There are two "bumps" with the data systematically above 
the model at about 3 x 10"'^ and 3 x 10"^^ cgs (although in this 
last one the discrepancy is well inside the uncertainties on the 
logA^-log5 fit). These bumps probably arise from fitting a bro- 
ken power-law with a sharp break to the logA^-log5 , which is 
really a smooth distribution coming from the superposition of 
different populations at different redshifts. This is compounded 
by possible calibration uncertainties: the hard band calibration 
of Chandra has changed by 12% in the last 3 years (see Section 
EJ). 

A break at 3 x 10"'^ cgs is actually present in the joint fit 
to the CDF-N and CDF-S hai'd (all-CDF) samples (see Table|5]), 
with the low flux slope compatible with the value from Bauer et 
al. 2004 (Frf = 1.56 ± 0.14) within about 1-cr, and the high flux 
slope flatter than in the global fit. The low flux slopes for the two 
single CDF samples are much flatter than and incompatible with 
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any previously reported values (~ 1). We believe that the change 
in the slope of the source counts between AXIS and the CDF 
forces the global break flux to the overlapping fluxes between 
those samples (~ 10"''* cgs), and hence the fit below this flux 
somehow averages the two slopes present in the CDF, forming 
a bump around the CDF break, rising and dropping above and 
below this flux, respectively. The overall behaviour of the logA^- 
logS can be obtained by a fit to the CDF and AMSS data alone, 
and, despite the lower AXIS source counts, the best fit values 
between the CDF+AMSS and the CDF+AMSS+AXIS are mu- 
tually compatible within <l-cr, the main eff'ect being pushing 
the break flux to higher fluxes and steepening the low flux slope. 
Additionally, the uncertainties in all parameters are significantly 
reduced by the inclusion of the AXIS data, by factors between 
about 2 and 20. 

The best fit curve by Moretti et al. (I2003I I is again very sim- 
ilar to our all-sample fit at the highest (r„ = 2.57^QQg) and low- 
est fluxes (Frf - 1.44^^-^^), where it is however above the CDF 
data (perhaps because of the change in the hard band calibra- 
tion of Chandra). Their transition flux occurs at lower fluxes 
(Sb = (4.5![^) X 10"'^ cgs), compatible with the CDF-N and 
all-CDF fits, perhaps because their functional form is smoother 
than a broken power-law and cannot accommodate the relatively 
strong break between the AMSS/AXIS steep slope and the CDF. 
The HELLAS2XMM confidence interval overlaps with ours, but 
has a flatter slope, and falls clearly below both the AXIS and the 
CDF points below about 2 x lO"''* cgs, perhaps suggesting in- 
completeness at their lower fluxes. 

Our fit to the joint AXIS-BSS source counts in the XID 
band looks visually quite good, passing through the middle of 
the AXIS and BSS data points (see Fig. |2|. The XI D band 
source counts from the BSS (Delia Ceca et al. 120041 ) require 
a steeper slope (F,, = 2.80 ± 0.11) than both our AXIS-only 
and AXIS-i-BSS fits. The origin of this apparent discrepancy is 
probably that we have used all the selected sources in both sam- 
ples, while they have excluded the stars, which contribute more 
at higher fluxes, and hence would produce a steeper source count 
distribution, as observed. 

The AXIS-only ultra-hard source counts merge smoothly 
with the HBS, if we ignore our higher flux point (which could be 
affected by the exclusion of the bright targets in our survey). We 
have not used AXIS sources with fluxes below 10"''* cgs in the 
fit because the logA^-logS' steepens suddenly at those fluxes, per- 
haps because the Eddington bias is most important in this band 
where the number of sources is smallest, and/or because of the 
contribution from spurious sources. The best fit values including 
these lowest flux sources are very similar to the ones exclud- 
ing them, but the errors on the values are much larger. Since the 
ML fit is dominated by the bulk of the sources, rather than by 
a few discrepant points, and given the low number of sources, 
the AXIS-only and AXIS-hHBS source counts are very similar 
and mutually compatible. They are also compatible as well with 
the HBS-only value (r„ - 2.64+° 23), despite the fact that this 
logA^-log5 again only includes thie extragalactic sources. The 
reason why the discrepancy is almost unnoticeable in the ultra- 
hard band is probably because the stars contribute negligibly to 
the high Galactic latitude source counts in this energy band. 

We have compared our ultra-hard (4.5-7.5 keV) source 
counts to some previous results in the 5-10 keV band, assum- 
ing a photon index of 1.7 for the flux conversion between these 
bands. Our source count slope is slightly steeper than in Baldi et 
al. ( 120021 r„ = 2.54+;j ^nd flatter than in Loaring et al. ( 120051 
r„ = 2. 80^5 5^), but well within the l-cr limits in both cases. The 



source counts from Rosati et al. ( 120021 r„ - 2.35 ± 0.15) are 
much flatter, but still compatible with ours within less than 2- 
cr. They also find an indication of a break in the source counts 
at a 5-10 keV flux of 4 x 10"'^ cgs (or about 3 x 10"'^ cgs 
in the ultra-hard band). Since they also reach fainter fluxes 
(5 ultra-hard ~ 1 .6 X 10"'^ cgs), their flatter source count slope 
probably arises from a combination of an Euclidean slope at 
brighter fluxes and an even flatter slope at their faintest fluxes. 
Our absolute source counts agree well with both Loaring et al. 
(2005) and Rosati et al. (2002) at their brightest flux limits, and 
this agreement is maintained for our best fit logA^-log5 over the 
whole flux interval in the former case, while the latter is clearly 
flatter and below our best fit, indicating a flattening of the ultra- 
hard source counts below our flux limit as discussed above. The 
XMM-COSMOS ultra-hard number counts coincide with ours at 
the lowest fluxes (~ 10"'^ cgs), but they are above ours at higher 
fluxes, although within their relatively large error bars. 



5. Contribution to the X-ray background 

Using the best fit parameters from Table |5] we can estimate 
the intensity contributed by sources in different flux intervals 
(Eq.[T]l. In order to compare the total intensity contributed by re- 
solved sources with the total XRB intensity, we need to estimate 
the intensity from sources brighter than our survey limit (see 
Section lSTt . and to adopt a total XRB intensity (see Section l572] i. 

5. 1 . Intensity from bright sources and stars 

The contribution from bright sources is straightforward to ob- 
tain for the "traditional" soft and hard bands. For the soft band 
we have followed the same method as Moretti et al. (2003) sum- 
ming the fluxes of the sources in the Rosat Bright Survey (RBS, 
Schwope et al. 2000) with soft flux higher than 10"" cgs, and 
dividing by the area covered by the RBS. For the hard band we 
have used the sources in the HEAO-1 A2 survey (Piccinotti et al. 
1 1982 ). which is complete down to 3.1 x 10"" cgs. We have esti- 
mated the contribution of bright sources for the ultra-hard band 
from the hard band values, converting both the flux limit and 
the intensity to the ultra-hard band using a power-law with slope 
F = 1.7. The XID band strides the hard and soft bands and it 
is necessary to combine measurements taken in different bands 
and with diff'erent instruments. We have estimated the 2-4.5 keV 
flux limit and intensity from the hard band values using again 
r = 1.7, and added both of them to the soft band values. 

Since the estimates of the XRB intensity discussed below 
are for the extragalactic XRB, we need to estimate the contri- 
bution from Galactic stars down to our fainter flux limits, in or- 
der to subtract it from our estimated source intensities and get 
the resolved fraction of the extragalactic XRB. We take the es- 
timates of Bauer et al. (2004) from the fractions in their Table 
2 and their assumed total XRB intensities: 0.19+g g4 x 10"'^ and 
0.36;^Q jg X 10"'^ cgs deg"2 from stars in the soft and hard bands 
respectively. Since most of the contribution from stars comes 
from high fluxes (Bauer et al.'2004), and the stars at those fluxes 
have mainly low temperature thermal spectra (Delia Ceca et al. 
2004), we have estimated the stellar contribution in the XID 
band from the soft band contribution assuming a 0.5 MK mekal 
model under xspec, obtaining an XID-to-soft flux ratio of 1. 
Our estimate of the XID band intensity from Galactic stars is 
thus 0.19;^Q Q4 X 10"'^ cgs deg"2. This is a conservative estimate, 
since the source counts in the XID band are about one order 
of magnitude shallower than in the soft band. Using that same 
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Table 6. Intensity in different bands from different origins and flux intervals. The first column indicates the band, the second gives 
the origin of the intensity, the third and fourth columns list the flux interval, the fifth column total intensity from that interval, the 
sixth column shows the fraction of the XRB intensity contributed by sources in that interval. 



Band 


Origin 


^ min 


^ max 


mill S o S o max/ 


/xRB 






(cgs) 


(cgs) 


(10-" Cgs deg-^) 




soft 


Best fit AXIS+CDF logA'-logX 





3 X 10-" 


0.25 


0.03 


soft 


Best fit AXIS+CDF logA'-logS' 


3 X 10"" 


2 X 10-" 


5.60 


0.75 


soft 


Best fit AXIS+CDF logA'-fogi' 


3 X 10-'^ 


10-" 


6.54 


0.87 


soft 


RBS sources" 


10-" 




0.20 


0.03 


sort 


lotal resolved 


3 X 10"" 




0.55 


0.87 


soft 


XKB 






/.5 ± 0.4 




hard 


Best fit AXIS+CDF+AMSS logA'-logS 





3 X 10-"^ 


0.62 


0.03 


hard 


Best fit AXIS+CDF+AMSS logiV-log5 


3 X 10-'" 


1 X 10-" 


17.08 


0.85 


hard 


Best fit AXIS+CDF+AMSS logiV-log5 


3 X 10^'* 


3.1 X 10-" 


17.18 


0.85 


hard 


HbAU- 1 Az sources 


Q 1 1 n— I 1 
i.V X 10 




0.4j 


O.Oz 


hard 


Total resolved 


3 X 10 




1 -7 

1 /./5 


0.85 


hard 


XRB* 






20.2 ±1.1 




XID 


Best fit AXIS+BSS logA'-log5 


3 X 10"'^ 


10-" 


8.48 


0.56 


XID 


Best fit AXIS+BSS logA'-log^ 


3 X 10-'5 


2.38 X 10-" 


9.00 


0.59 


XID 


Bright sources'' 


2.38 X 10^" 




0.39 


0.02 


XID 


Total resolved 


3 X 10-'^ 




9.20" 


0.60 


XID 


XRB"* 






15.3 + 0.6 




ultra-hard 


Best fit AXIS+HBS logW-log^ 


9 X 10^'^ 


2 X 10-" 


1.50 


0.21 


ultra-hard 


Best fit AXIS+HBS logA'-logS' 


9 X 10-'= 


1.05 X 10-" 


1.74 


0.24 


ultra-hard 


Bright sources'' 


1.05 X 10-" 




0.15 


0.02 


ultra-hard 


Total resolved 


9x 10-'^ 




1.79" 


0.25 


ultra-hard 


XRB"* 






7.2 ±0.4 





" Schwope et al. l|2000} 

* Moretti et al. (2003 1 

" Piccinotti et al. (1982^ 

'' See text 

" After subtracting the stellar contribution (see Section lsTl l 



spectral model, the stellar contribution in the 1-2 keV band is 
(0.036 ± 0.010) X 10-'^ cgs deg-2. Given the even higher flux 
limit in the ultra-hard band, and the strong dependence of the 
thermal spectrum with energy, we have instead estimated the 
stellar contribution to the ultra-hard band intensity by summing 
the fluxes of the two sources identified as stars in the HBS, 
and dividing it by the sky coverage of that survey, obtaining 
(0.099 + 0.005) X 10-'^ cgs deg-^. No sources have been iden- 
tified as stars among the 60 identified XMS deeper ultra-hard 
survey sources (Barcons et al. I2007I I. and all of the 10 remain- 
ing unidentified objects in that sample are extended. This lower 
fraction of stellar identifications of X-ray sources as the flux de- 
creases and the band hardens, is consistent with the average soft 
thermal X-ray spectra of stars and their flat source counts (Bauer 
et al. l2004li . 



5.2. Total X-ray background intensity 

The total extragalactic XRB intensity measured with different 
instruments produces different results (Barcons et al. I2000I I. and 
not all the differences are attributable to cosmic variance. Moretti 
et al. (I2003I I have averaged several measurements available in the 
literature in the 1-2 keV and hard bands, obtaining 4.54 + 0.21 
and 20.2 ±1.1 (in units of 10-'^ cgs deg-^), respectively. We have 
used those values to estimate the XRB intensity in our bands 
assuming a power-law with F - 1 .4, which is an adequate model 
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Fig. 4. Relative contribution to the "total" intensity from equally 
spaced logarithmic flux intervals, using the best fit models in 
each band (see text): soft (solid line), hard (dashed), XID (dot- 
dashed) and ultra-hard (dotted). 

for the extragalactic XRB spectrum above 2 keV (Lumb et al. 
2002). The extrapolation of the value of Moretti et al. (2003ll 
in the hard band to the 1-2 keV band using that spectral shape 
produces a value similar to the one obtained directly by them 
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in that band, this justifies extrapolating again the same spectral 
shape down to 0.5 keV, although the shape of the XRB is poorly 
known below 1 keV. The results are given in Table |6] 

Our adopted hard band XRB intensity (from Moretti et al.) is 
in agreement with the estimate from Lumb et al. (2002, 2 1 .5 +2.6 
in the above units), and compatible with the result of De Luca & 
Molendi (i2004i 22.4+1 .6) within about 1 -cr. Exti-apolating those 
intensities to the soft band using F = 1.4 we obtain 7.46 ± 0.09 
and 7.78 ± 0.06, respectively, again fully compatible with our 
adopted value of 7.5 ± 0.4. 

Recently, Hickox & Markevitch ( l2006l henceforth HM06) 
have re-estimated the total XRB intensity in the 1-2 keV and 2- 
8 keV bands, using Chandra CDF data, and contributions from 
brighter sources. They have analysed thoroughly the non-cosmic 
background in Chandra, and isolated unresolved components of 
the XRB in those bands, which are only about ~20% and ~4% of 
the non-cosmic background contributions in those bands, respec- 
tively. The origin of this unresolved components is unknown. 
One possibility is stray light from sources outside the field of 
view. The HM06 total XRB intensity estimates in those bands 
are obtained by summing to these unresolved components the 
contributions from the resolved sources in the CDF data, and 
from brighter sources using the logA^-logS' of Vikhlinin et al. 
( I1995I I. with different spectral slopes at different fluxes. Their 1- 
2 keV intensity is 4.6 ± 0.3 (in the above units), very similar to 
our adopted value. With their estimates of the total XRB inten- 
sity, the total resolved fractions of the XRB are only 77 + 3% and 
80 ± 8% in the 1-2 keV and 2-8 keV bands, respectively, lower 
than previous estimates, in particular in the softer band. We will 
discuss the origin of these differences in Section l54l 

5.3. Contribution of different flux intervals to the X-ray 
background 

X-ray intensities are shown in Table|6]for the observed flux inter- 
vals in the samples used here, and for flux intervals with higher 
maximum fluxes, chosen to "join" the observed intervals with 
the contribution from bright sources, which are also given in that 
table, as well as the XRB intensities from Moretti et al. (2003). 

We have plotted in Fig. |4] the relative contribution of two 
flux intervals per decade to the total XRB intensity for the four 
default bands, assuming our best fit logA^-logS' . It is clear that 
the maximum contribution comes from fluxes around the break 
flux ~ 10 cgs. The contribution from the bins in one decade 
around that value are close to 50% of the total in the soft and 
hard bands. In the ultra-hard band we do not reach deep enough 
to detect the break. From the 5-10 keV band source counts of 
Hasinger et al. (2001) and Rosati et al. ( |20021 l. there could be a 
break just below ~ lO"''* cgs. 

The extrapolation the soft and hard logA^-log5 to zero flux 
using our best fit model (see Table|6]l does not saturate the XRB 
intensity (although in the soft band total XRB intensity is within 
the intensities spanned by the uncertainties in the best fit logA^- 
logS parameters). This suggests that the possibility of a new 
dominant population at lower fluxes is still open, mainly in the 
hard band. Under the assumption that the fraction of truly dif- 
fuse XRB is negligible (and the fluctuation analysis of Miyaji 
& Griffiths 120021 shows that the source counts continue growing 
in the soft band down to at least 7 x 10"'^ cgs), it is possible 
to estimate the minimum source counts slope necessary to just 
saturate the XRB at zero flux, if the source counts steepened just 
below the minimum flux studied here. Those slopes are 1.85 and 
1.84 in the soft and hard bands, respectively. Comparing these 
values to the observed slopes of the separate AGN and galaxy 



source counts in the CDF (Bauer et al. |2004| l, only the absorbed 
AGN (slope ~ 1.62, versus 1.2-1.5 for the rest of the AGN es- 
timates) come close to be able to saturate the soft XRB, while 
galaxies can do it with just about any estimate for their source 
counts slope (2. 1-2.7), if it keeps growing at the same rate below 
the resolved fluxes. The situation in the hard band is again the 
same, with the absorbed AGN (source counts slope 1.95, versus 
1.4-1.5) being the only AGN population able to contribute the 
rest of the unresolved intensity, while the galaxy source counts 
estimates (slope 3-3.5) could easily fulfil this role. A similar con- 
clusion is reached if a higher intensity for the XRB is adopted 
(HM06), since this would steepen the faint source counts slope 
required to saturate the XRB. 

Even if the source counts re-steepened as discussed in the 
previous paragraph, it is clear that the maximum contribution to 
the XRB in the soft and hard bands (and probably also in the 
XID band, and perhaps also in the ultra-hard band) comes from 
sources with fluxes within a decade ~ 10"'"* cgs, where most of 
the AXIS sources in those bands lie. This is also true if the XRB 
intensity is higher than the value used here. Medium depth sur- 
veys with limiting fluxes close to that value are therefore crucial 
to understand the evolution of X-rays in the Universe, at least 
in the (relatively) soft bands considered here. Sources at lower 
fluxes and/or heavily obscured are of course much more impor- 
tant for the overall energy content of the XRB (Gilli et al. 120011 
Fabian & Iwasawa 1999), since most of it resides in harder X- 
rays, where the resolved fraction is much smaller (Worsley et al. 
50041) . 

5.4. Resolved and unresolved components of the X-ray 
background 

Comparison between HM06 results and ours (and previous) re- 
sults involve an uncertainty concerning the different bands under 
consideration (0.5-2 keV vs. 1-2 keV, and 2-8 keV vs. 2-10 keV). 
We have assumed power-law photon indices of 1.5 for the unre- 
solved component (as fitted to the unresolved XRB spectrum by 
HM06), 1.43 for the resolved faint sources (again as fitted by 
HM06 to the summed spectrum of their resolved sources), and 2 
for the resolved sources brighter than ~ 10"'** cgs (Mateos et al. 
! 2005l ), for the conversions from the 1 -2 ke V to band the soft band 
and vice versa, as well as the conversions from the 2-8 keV band 
to the 2-10 keV band. HM06 have excluded from their study of 
the unresolved XRB areas around the sources in Alexander et al. 
( 120031 ), with 0.5-2 keV and 2-8 keV flux limits of 2.5 x 10 '^ and 
1.4 X 10 cgs, respectively. Hence, HM06 define as unresolved 
intensity that coming from sources below those flux limits (or 
from a truly diffuse component). 

An additional correction comes from the 2-8 keV band flux 
limits for the resolved sources of HM06, which they take from 
Alexander et al. (|2003) . This same sample was also the basis 
of Bauer et al. ( |2004| l. We have already seen (Section [4. 2t that 
those fluxes need to be decreased by 12% due to a change in 
the Chandra calibration, in addition to the correction due to the 
different bands, just discussed. Assuming an spectral slope of 
1 .43 and taking into account this flux correction, the flux limit in 
the 2-10 keV band becomes 1.6 x 10"'^ cgs. 

We compai-e in Table|2]the 0.5-2 keV, 1-2 keV and 2-10 keV 
X-ray intensity (from different origins and flux intervals) from 
HM06 and from the results from this and previous works. The 
error bars on our estimates of the intensities have been estimated 
from the errors on the logA^-logS' best fit parameters using the 
standard error propagation rules (Wall & Jenkins 120031 . This 
has not been possible for the extrapolation to zero flux, since the 
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Table 7. Comparison between the estimated X-ray intensities 
from HM06 and this and previous works, originating from dif- 
ferent origins and flux intervals. The table is divided in three 
sections: the top one is for the soft band, the middle one for the 
1-2 keV band and the bottom one for the hard band. In the mid- 
dle section the conversion between 0.5-2 keV and 1-2 keV have 
been done in each flux interval using the photon indices given 
at the beginning of Section 15.41 The first two columns indicate 
the flux limits between which has been estimated the intensity, if 
the first one is missing the intensity is calculated from zero flux, 
while if the second one is missing, the intensity is calculated to 
infinity. The third column is the intensity from HM06, and the 
fourth column the intensity from this work (or previous ones as 
indicated). The last three rows in each table section give the to- 
tal intensity (as estimated directly by the sum of the values in the 
third column by HM06, and as estimated from previous works 
in the fourth column), the total intensity resolved into sources 
(with the stellar contribution subtracted, see Section ISTt . and 
the fraction of the total intensity resolved into sources. 
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expressions involve the natural logarithm of the lower limit of 
the interval. In this case we have used the values spanned by the 
uncertainties in the best fit logA^-logS' parameters, as explained 
in Section|43] 

The most noticeable diff'erence is between the total soft XRB 
intensities, which are not compatible at the ~l-cr level, while 
the corresponding 1-2 keV intensities are compatible at 0.16-cr 
(see Section lSTSI ). This is because of the very different ways they 
have been obtained: our adopted value is from an extrapolation 
of the Moretti et al. (|2003) 1-2 keV total XRB intensity assum- 
ing r = 1.4, while the HM06 estimate uses different contribu- 
tions with different values of the photon index. Since the contri- 
bution from the brightest sources is almost half of the total, and 
they have the steepest spectra, the "eff'ective" spectral slope in 
the conversion of the HM06 XRB intensity from 1-2 keV to 0.5- 
2 ke V is r ~ 1 .72, much steeper than our assumed F = 1 .4, and 
hence with a much larger contribution from the 0.5-1 keV inter- 
val. The HM06 resolved contribution also increases in the soft 
band with respect to the 1-2 keV band, but the resolved frac- 
tion increases only slightly, because of the very similar effective 
spectral shapes of the resolved component and the XRB inten- 
sity. In contrast, our estimated resolved component is very sim- 
ilar to HM06, but our assumed XRB intensity in the 0.5-2 keV 
has a much flatter spectral shape, resulting in a smaller resolved 
fraction, but still compatible within the errors. 

We have also converted our different resolved contributions 
from the soft band to 1-2 keV using different spectral slopes 
for different flux intervals (as indicated at the beginning of this 
Section). The differences in the resolved components and in 
the resolved fraction with respect to HM06 are very small (see 
Table|7J and well within the mutual uncertainties. 

In summary, in the soft band the difference in the total back- 
ground intensity is just above 1-cr, and it mostly arises because 
the different effective spectral shape assumed for the total XRB 
intensity between HM06 and most previous works. In contrast, 
the resolved intensities are very similar in HM06 and in our 
work. 

A further source of uncertainty in the comparison of both 
XRB intensities is our extrapolation of the F = 1 .4 XRB spectral 
shape below 1 keV, where its real shape is not known. Assuming 
that the XRB spectrum steepens just below 1 ke V with a power- 
law shape, we have estimated that F - 2.2 would be needed 
to produce our estimate of the HM06 soft XRB intensity from 
their 1-2 keV XRB intensity, which is difficult to accommodate, 
given the average slope of the faintest sources detected and of the 
unresolved component in the soft band (F ~ 1.4 - 1.5, HM06). 
Therefore, the uncertainty in the <1 keV XRB spectral slope 
cannot fully account for the difference in the soft XRB intensities 
discussed above. 

In the hard band there is a small, statistically not signifi- 
cant, discrepancy in the total background intensity, and we agree 
quite well also in the resolved component, with us estimating a 
slightly higher value, but well within 1-cr. Our estimate of the 
bright source contribution in the hard band is fairly robust, since 
the AMSS sources cover that part of the logA^-logS' . 

Extrapolating the logA^-logS' to zero flux, we cannot saturate 
the unresolved component, so either there is a diffuse compo- 
nent, or the logA^-log5 has to steepen again somewhere below 
the current flux limits (see Table s|6] and |7] and HM06). 

6. Looking for source clustering 

A survey to detect source clustering requires both depth (to 
achieve high angular density), and width (to minimise the possi- 
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Table 8. Information on the simulations for the angular correlation function, together with fit results: Band is the band where the 
sources were selected, A^pooi is the number of sources in the pool used for the bootstrap simulation (see text), is number of sources 
selected in the real sample, A^sim is the number of simulations, is the value of the fit when using a "null" model, is the best 
fit value for a power-law model (see Eq. |6]l, with the best fit parameters and their 1-cr uncertainty intervals given the the last two 
columns, A^bin is the number of bins used in the fit, and P{F) is the F-test probability that the improvement in the fit is significant (the 
smaller the P(F) the higher the significance). The second row in each band corresponds to the best fit fixing y to the "canonical" 
value 0.8. 
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bility that a single structure biases the overall average). In prac- 
tice, a compromise between those two conflicting requirements 
has to be achieved. The AXIS sample comprises 36 fields outside 
the Galactic plane, and a relatively high source density (about 
35 sources per field, at least in the soft and XID samples), and 
we have tested what can it say about the angular distribution of 
sources in the sky. 

6.1. Cosmic variance 

The simplest test for source clustering is to compare the actual 
number of sources detected in each field A^;; with the number ex- 
pected Ak from the best fit logA^-logS' and the sky area of each in- 
dividual field ilk (this is similar to the traditional counts-in-cells 
method). In principle, if just one field (or a few fields) happens 
to look through strong cosmic structures, its number of sources 
should be significantly different from the expected number from 
a random uniform distribution, as measured by the overall logA^- 
log5. 

The statistics we have used to measure the deviation from 
such a random uniform distribution are the cumulative Poisson 
distributions: 

PaS> Nk) = ZT=m P^^^^ if > 

where Pa{1) is the Poisson probability of detecting I sources 
when the expected number of sources is A. The cumulative dis- 
tributions above give the probability of finding > Nk (first row) 
or < Nk (second row) sources when the expected number from 
the source counts is Ak- This method is similar to the one used in 
Carrera et al. ( 1 19981 1, but in that work we used Pa{1) instead of 
the cumulative probabilities. We believe that the approach used 
here is a more conservative estimate of how likely is to find a 
number of sources in a field which is different from the expected 
value. 



The maximum likelihood statistics for the whole sample is 
then 

L' = ^ Pa,{> Nk) + Yj PaA< Nk') 

k k' 

, where k runs over the fields for which A^^ > Ak, and k' over the 
fields for which Nt < Ak' . 

We have compared the observed L' values in each band with 
10000 simulated values, using the values of Ak and Poisson 
statistics. The number of simulations with likelihood values 
above the observed ones were 1388, 4580, 778 and 4958 for the 
soft, hard XID and ultra-hard bands, respectively. Nothing sig- 
nificant is found in the hard or ultra-hard samples, while some 
deviation below or about at the 90% significance level is found 
in the soft and XID bands. These results, although formally at 
a low significance, have encouraged us to try more elaborated 
tests for clustering. 

6.2. Angular correlation function 

If cosmic structure is present in all (or most) fields, a test for 
significant deviations from the mean number of sources in each 
field from some overall average will not give significant results. 
We should look instead for evidence of sources tending to appear 
together in the sky with respect to an unclustered source distri- 
bution. The classic parametrisation for this effect is the angular 
correlation function W{9) which measures the excess probability 
of finding two sources in the sky at an angular distance 6 with 
respect to a random uniform distribution (Peebles [l980l) : 

SP = r?6ni6Q.2[l + W(0)] (4) 

where 6P is the probability of finding two objects in two small 
angular regions 6Q.1 and 6Q.2, separated by an angle 6, when the 
sky density of objects is n. 

Since the angular separation is a projection in the sky of 
the real spatial separations of the sources at different redshifts. 
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Fig. 5. W{6) vs. 6 for the soft (top-left), hard (top-right), XID (middle-left), and ultra-hard (middle-right) samples, for a hardness- 
ratio selected sample (bottom-left) and for a random selection of one third of the soft sources (bottom-right). The solid dots are the 
integral-constraint-corrected observed values, the grey triangles show the "zero points" from the integral constraint (displaced to 
the left for clarity), and the solid lines are the best fit;^'^ fits to Eq.|6] The middle-right and two bottom panels have different scales 
on their Y-axis. 



the underlying spatial clustering is somewhat blurred with this 
purely angular measurement. Unfortunately, the more powerful 
spatial clustering depends on having redshifts for a very high 
fraction of the sources, or at least knowing precisely what is 
their redshift distribution. Since none of these two conditions 



are fulfilled by any of the AXIS samples, we have used the data 
presently at hand to study the angular correlation function. 

There are several proposed ways of measuring the angular 
correlation function, most of which look for an excess number 
of source pairs at a given angular separation with respect to 
a simulated random "uniform" sample (Landy & Szalay 119931 
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Efstathiou et al. |1991| l. We have chosen the one proposed by 
Efstathiou et al. jlWH . 



DD 

w{e) = / 1 

■' DR 



(5) 



these two fields from all subsequent angular clustering analysis, 
leaving 34 fields. 

The W{9) obtained in this fashion are shown in Fig.|5] as well 
as the coiTespondingbest;^^^ fits to a power-law model 



where DD is the number of actual pairs of sources with angular 
separation 9, and DR is the number of pairs of one real and one 
randomly placed simulated source (see below) at the same sep- 
aration, while / is a normalisation constant to take into account 
the different number of real and simulated sources. The error 
bars around each point are given by AW(0) - V(l + W{0))/DR 
(Peebles [T980] l. 

The "randomly placed" simulated sources have to follow 
as closely as possible the real distribution of the source detec- 
tion sensitivity of the survey, giving a "flat" random sky against 
which to judge the presence (or otherwise) of significant over- 
densities at different angular separations. We have used boot- 
strap simulations by forming a pool of real sources with detec- 
tion likelihoods higher than 15 (our standard value) in the band 
under study, irrespectively of whether their count rates were 
above or below the sensitivity map of the corresponding field at 
the source positions. Then, keeping the number of sources sim- 
ulated in each field equal to the real number of sources N^, we 
have extracted sources from this pool, keeping their count rates, 
and their distances to the optical axis of the X-ray telescope, but 
randomising their azimuthal angle around it. If the source had 
a count rate above the sensitivity map of the field under con- 
sideration at its "new" (X, Y) position, the source was kept in 
the simulated sample, otherwise a new one is extracted until A^^ 
valid simulated sources are found. In this way, the angular distri- 
bution of the simulated sources mimics the decline of the source 
detection sensitivity with off-axis angle. The number of random 
samples A^sim used in each band are shown in Table[8] They have 
been chosen so as to give a total of about a million simulated 
sources in each band. 

With this recipe, the normalisation constant / above is given 

by 

Nk being the number of sources in field k and A^sim the number 
of simulations. 

If a positive correlation is present at angular scales compa- 
rable to the individual field size, the estimate of the mean sur- 
face density of objects from the survey is too high, and this 
causes a negative bias in the angular correlation function known 
as the integral constraint (Basilakos et al. 2004). Given the com- 
plicated dependence of the sensitivity over the area of our sur- 
vey, we have corrected for this effect empirically, finding the an- 
gular correlation function that we would have detected in the 
absence of correlation, via the average of A^sim simulated re- 
alisations of W{0), where the real data were replaced by ran- 
dom samples, simulated independently following the recipe in 
the previous paragraph. The triangles in Fig.|5]show these "zero 
points" at each angular scale, which have been used to increase 
the corresponding observed W{0), convolving the error bars us- 
ing Gaussian statistics. 

Applying Eq. |5]to the full sample results in a ~3-cr signif- 
icant bump at about 200 arcsec, which was also present in the 
simulations, but at lower significance. The bump turned out to be 
present only in two fields (HD 111 8 12 and HDl 17555), a search 
in the literature revealed that both fields have stellar clusters in 
their field of view (Eggen & Iben |1989l l. Since we are interested 
mainly in extragalactic structure, we have opted for excluding 



(6) 



The best fit parameters for this model are given in Table [8] 
In that Table we also give the significance of the detection from 
an F-test comparing the value of the power-law model with a 
simple W{6) - no-clustering model. The F-test suggests sig- 
nificant coiTelation in the soft and XID bands at the ~99% level. 
The lack of significant detections in the two harder bands might 
be due to the lower number of sources with respect to the soft 
and XID bands (see below). 

Gandhi et al. (2006) also found significant angular correla- 
tion in the XMM-LSS sample in the soft band, with a similar 
slope {y - 1 .2 ± 0.2), and a lower coiTelation length (^o - 1 + 
3 arcsec), but still compatible with ours within ~l-cr. Vikhlinin 
& Forman d 19951 1 foundy = 0.7±0.3 and 6*0 = 4+3 arcsec, again 
a lower correlation length than us, but compatible within less 
than 1-cr with our y = 0.8 result. On the other hand, Basilakos et 
al. (2005) also found significant angular correlation in this band, 
but with a higher correlation length {9q - 10.4 + 1 .9 arcsec, with 
the canonical slope), just compatible with our results at about 
the 2-0" level. Basilakos et al. (12005b also used Limber's equa- 
tion (Peebles 1980) to calculate the spatial correlation function 
from the angular correlation function, assuming several differ- 
ent AGN X-ray luminosity functions (since most X-ray sources 
at low flux levels are expected to be AGN). Their high coiTela- 
tion lengths are only compatible with AGN optical (e.g. Croom 
et al. 120021 Grazian et al. .2004J or X-ray correlation functions 
(Carrera et al. [T998l Akylas et al. |2000l MulUs et al. 12004) if the 
clustering is constant in physical coordinates, while optical QSO 
clustering seems instead to be constant in comoving coordinates 
(Croom et al. 2001j but see also Grazian et al. 2004 Croom et 
al. l2005l l. Akylas et al. (120001) also found significant angular cor- 
relation in the soft band sources in the RASS-BSC, but at much 
higher angular distances (~ 8°). 

Correlation of hard band selected sources is not detected by 
Gandhi et al. (2006 ) (413 sources) and Puccetti et al. (2006 ) (205 
sources). However, clustering is very significantly detected (> 
4o-) by Basilakos et al. (I2004I I (171 sources) with y - 1.2 ± 
0.3 and 9q - 49^25 arcsec. Clustering is also detected at similar 
significance by Yang et al. (2003 ), with 0o = 40 ± 1 1 arcsec (for 
y = 0.8) for 278 sources. This is somewhat surprising since the 
Basilakos et al. sample has less than half the number of sources 
than Gandhi et al., and a similar number of sources to Puccetti et 
al., and the Yang et al. sample is again smaller than the Gandhi et 
al. sample. Our angular correlation length is compatible with the 
Basilakos et al. (|2004| l (and with the Puccetti et al.) result within 
1-cr, but not with the Yang et al. correlation length. However, 
the F-test on the correlation function, and the superior PAiN) 
test (see below and Fig. |6]l do not detect clustering. Again, our 
sample seems to give somewhat intermediate results in between 
those of Basilakos et al. ( |2004| l and Yang et al. ( 120031 ) on one 
hand, and Gandhi et al. (120061 ) and Puccetti et al. (120061 ) in the 
other. 

Since each bin in W(6) contains information from many 
pairs, perhaps involving the same real and simulated sources 
many times, it is at least debatable whether the hypothesis of 
independent bins underlying the test applies in this case. To 
circumvent this problem, we have applied a simple test for the 
significance of the correlation, using Poisson statistics to com- 
pare the expected number of pairs (at distances smaller or equal 
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Fig. 6. 1 - PfjiNe) vs. for the soft (top-left), hard (top-right), XID (middle-left), ultra-hard (middle-right) selected samples, for a 
hardness-ratio selected sample (bottom-left) and for a random selection of one third of the soft sources (bottom-right). The median 
(50%), 99% and 99.9% levels from random simulations are also shown (grey jagged lines). The Y-axis scaling is different in different 
panels, and has been chosen for maximum clarity. 



than each individual observed distance) in absence of correla- 
tion (fj. - YjO'kb DRiO')/ f) with the number of observed pairs 
(Ng - 2e'<e DD{9')). The quantity 1 - PftiNg) versus the angu- 
lar distance is shown in Fig.|6]for each band. We can estimate 
the "signal" in the absence of correlation (similarly to what we 
have done to quantify the integral constraint) repeating this exer- 
cise, but replacing the real data by randomly placed sources (see 



above). For each sample we have performed 10000 simulations 
along these lines. In this way, we have for each angular scale and 
sample the distribution of the expected value of 1 - P^{Ng) if the 
sources are not correlated. We show in Fig. |6] the median and 
the upper 99% and 99.9% percentiles of those distributions for 
each angular scale. The simulations show clearly that the detec- 
tion significance is actually lower than the value given purely by 
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Fig. 7. 1 - PfiiNe) vs. 6 for the soft band and for the fields in the northern (left) and southern (right) Galactic hemispheres. The 
median (50%), 99% and 99.9% levels from random simulations are also shown (grey jagged lines). 



Poisson statistics, when there are more than ~1000 sources, as 
in the soft and XID bands, where the 99% percentile is around 
the Poisson 3-cr level. While in the hard and ultra-hard bands 
the 99% percentile is around the "right" Poisson position, albeit 
with large excursions. 

Using the percentiles from the simulations, we have found 
a > 99% (but mostly <99.9%) significant detection of correla- 
tion in the angular positions of in our soft sample at separations 
200-700 arcsec. In the XID band the significance is lower, at just 
about 99% significance, with essentially one peak of higher sig- 
nificance at ~300 arcsec. In contrast, no signal is found in either 
the hard or ultra-hard bands. The lack of significant detections 
in the two harder bands might be due to the lower number of 
sources with respect to the soft and XID bands. 

We have performed a test for the influence of the number 
of sources in the detection of clustering, since we only detect 
clustering in the samples with more than 1000 sources. We have 
rejected at random two of every three sources in the soft band 
sample, and repeated both tests for correlation: the clustering 
signal disappears (see Table [8] Figs. |5] and |6l). We have also re- 
peated this test keeping at random one half and two thirds of 
the soft sources, only finding coiTelation at the 99% level in the 
last case. We conclude that clustering in the hard and ultra-hard 
bands may be as strong or stronger than clustering in the soft 
band, but our relatively low number of sources prevents us from 
detecting it. This is compatible with the results of Gilli et al. 
( 1200 5). which do not find any significant difference between the 
clustering properties of soft or hard X-ray selected sources in the 
CDF. 

The significant clustering found with the PfiiNe) of the soft 
band coiTesponds to a broad bump between about 150 and 
700 arcsec, with "peaks" at about 300, 400 and 650 arcsec (the 
first two are significant even at the 99.9% level). These "peaks" 
also appear to be present in the XID band. While the increase in 
the number of real pairs giving rise to the broad bump is very 
gradual, the "peaks" must correspond to significant increases in 
relatively naiTow ranges of angular distances. To investigate if 
these "peaks" come from a small number of fields in a partic- 
ular region of the sky, we have divided the sample between the 
fields in the northern and southern Galactic hemispheres. The 
1 - P^{Ng) for these regions are shown in Fig. Q Despite the 
lower significances (expected because of the lower numbers of 
sources), the broad bump and the peaks are still present in the 



samples from the two hemispheres, the ~300 arcsec peak being 
common to both, with the one at 400 arcsec appearing predom- 
inantly in the northern hemisphere, and the one at 650 arcsec 
in the southern hemisphere. We have also repeated the analysis 
excluding the two fields with obvious cluster emission (A1837 
and A399) from the northern and southern Galactic hemisphere 
sub-samples, respectively, finding no significant differences in 
the strength or position of the "peaks". As a further test, we 
have looked for signal in the 7 deeper fields (Texp>40 ks, ~400 
sources) and in the remaining 27 fields (~800 sources), finding 
it only in the latter, and hence the signal is not due to a few deep 
fields. 

A detailed discussion of the (cosmic or statistical) origin of 
these structures is outside the scope of this paper, and will be 
discussed with a larger sample elsewhere. However, whatever 
the origin of the angular structure we have found, it must be 
relatively widespread over the data and/or the sky. We take this 
to mean that, if it has a real cosmic origin, it must come from 
sources at z < 1.5, which is the peak of the redshift distribution 
of medium depth X-ray surveys (Barcons et al.|2007jl. 



6.3. Angular correlation of a hardness-ratio-selected sample 
of sources 

A low significance or absence of correlation in the hard band is 
somewhat surprising, since most sources at the flux levels of our 
sample are AGN (Barcons et al. 2007), which are known to clus- 
ter strongly (both X-ray selected -Mullis et al. 2004, Yang et al. 
2006 - and optically selected -Croom et al. |2005l -). Furthermore, 
obscured AGN appear more commonly in hard X-ray selected 
samples (Delia Ceca et al. [2006 Caccianiga et al. 2004 Delia 
Ceca et al. l2004l Barcons et al.|2007jl. 

Gandhi et al. (2006^ have found that, despite the absence of 
significant angular correlation in their hard sample, if they select 
only the hardest spectrum sources (using the soft-to-hard hard- 
ness ratio), and they use lower significance sources, the signifi- 
cance of the correlation increases to ~ 2-3cr. They lowered their 
significance level in order to increase the number of sources in 
the tested sample. This significant correlation only in the hard- 
est of the hard-X-ray-selected sources is a very intriguing and 
potentially interesting result. 
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We have done a similar analysis, defining the same hardness 
ratio///? = (H-S)/{H+S)wheie Hands are the 2-12 keV and 
0.5-2 keV count rates, respectively. We have selected the sources 
detected in either the soft or hard bands (in the sense explained 
in SectionlMll which have HR > -0.2 (Gandhi et al. ,2006i) . 

The F-test seems to indicate correlation at > 99% level 
(TablelSj, but the superior Poisson method (Fig.jS) does not de- 
tect clustering on any angular scales, with the data being actu- 
ally consistent with the median of the random simulations. We 
cannot thus confirm the results of Gandhi et al. (i2006j , despite 
having a slightly larger sample of sources (250 versus their 209). 
This is compatible with the results of Yang et al. (2006 ) which do 
not find any significant differences between the spatial clustering 
of sources with soft or hard spectra. 

7. Conclusions 

We present the results from AXIS (An XMM-Newton 
International Survey) which comprises 1433 distinct serendip- 
itous X-ray sources detected with a likelihood of 15 (~5-cr) 
in XMM-Newton EPIC pn observations in 36 different XMM- 
Newton observations at high Galactic latitude. We have defined 
sub-samples in four bands: 0.5-2 keV(soft, 1267 sources), 2- 
10 keV (hard, 397 sources), 0.5-4.5 keV (XID, 1359 sources) 
and 4.5-7.5 keV (ultra-hard, 91 sources). The first two being the 
"standard" X-ray bands, the third one chosen to span the energy 
range with the best sensitivity of that camera, and the last one 
taking advantage of the unprecedented hard X-ray sensitivity of 
XMM-Newton above 4 keV. Given the distribution of exposure 
times of the fields selected for AXIS, it will serve as a pathfinder 
to the X-ray and optical properties of the sources in large scale 
XMM-Newton catalogues (IXMM, 2XMM, . . .). 

Using count rates in the 5 standard XMM-Newton bands as 
low-resolution spectra we have fitted single power-laws to all 
sources in the XID and hard bands independently, in order to 
calculate source fluxes without assuming a constant spectrum 
(see Table|4]i. Our internal and external checks make us confident 
on the results from this technique. The average best fit power-law 
slope in the XID band is (F) ~ 1.8, independently of the sample 
to which the sources belong, while the average is (F) ~ 1 .5 - 1 .6 
in the hard band. 

We have constructed empirical sensitivity maps in the four 
bands, taking into account the exposure maps, the background 
maps and the excluded regions (see Table [TJ. We have derived 
sky coverages as a function of flux in the four bands, taking into 
account the distribution of spectral slopes of the sources. The 
total area covered is 4.8 deg^ (see FiglT)- 

Our data have been combined with both wider and shal- 
lo wer (B SS/HBSS, Defla Ceca et al. |2004l AMSS, U eda et 
al. I2005I I. and narrower and deeper (CDF, Bauer et al. |2004| l 
surveys to measure the X-ray source counts over as wide flux 
range as possible, reaching 4 orders of magnitude in the soft 
band (-2000 sources), ~3 in the hard and XID bands (-800 
and ~1800 sources, respectively), and a bit more than 1 in the 
(largely unexplored) ultra-hard band (~140 sources). Data from 
different observatories and instruments are mutually compatible, 
allowing the joint analysis of the source counts. 

We have performed maximum likelihood fits to a broken 
power-law model for the logA^-logS' in the four bands (see 
Table |5]). We have found that fits to the source counts using fixed 
spectral slopes produce similar results to those using individual 
source spectral slopes (as we have done here), but give rise to 
larger error bars. The source counts in the soft, hard and XID 
bands show breaks at fluxes ~ 10"'"* cgs. Detailed examination 



of the ratios between the data and that simple model in the soft 
and XID band do not show any significant differences, while in 
the hard band there seems to be evidence for a further flux break 
at ~ 3 X 10"'^ cgs and several changes of slope. Since this is only 
present in the hard band, it is difficult to assess whether this be- 
haviour is due to a too simple characterisation to contributions 
from different populations at different redshifts, or it is due to 
calibration uncertainties. Future large scale XMM-Newton cata- 
logues (such as 2XMM) will be useful to address this question. 
The best fit model parameter values are compatible with pre- 
vious smaller or similar surveys, but our combination of large 
number of sources and wide flux coverage produce in general 
smaller uncertainties in the best fit parameters. 

We have used our best fit logA^-logS' parameters to calculate 
both the total resolved fraction of the XRB (including the contri- 
bution from sources a bright fluxes, see Table|6l), and the relative 
contribution of different flux bins to the XRB (see Fig. |4]i. We 
have used the estimates of the average XRB intensity in the soft 
and hard bands from Moretti et al. ( I2003I I. translating them to the 
XID and ultra-hard bands using F = 1.4 (Marshafl et al. 1 19801 1. 
The total resolved fraction down to the lowest fluxes of the com- 
bined sample reaches 87% in the soft band and 85% in the hard 
band, where we have been able to reach deep fluxes using pen- 
cil beam surveys (Bauer et al. I2004I I. but it is only ~60% in the 
XID band and about 25% in the ultra-hard band. The total inten- 
sity produced by extrapolating our best fit soft and hard logA^- 
log5 to zero flux is insufficient to saturate the XRB intensity. 
Assuming a second flux break just below our minimum detected 
fluxes, we have estimated the minimum slope below this break 
necessary to produce the whole XRB with discrete sources, get- 
ting 1.85 (1.84) in the soft (hard) band. Comparing these slopes 
with those of the fainter galaxy and AGN source counts from 
the CDF (Bauer et al. 2004), reveal that galaxies could easily 
provide this re-steepening, while among AGN only perhaps the 
absorbed ones could just about do it. 

The maximum fractional contribution to the XRB in the soft, 
hard and XID bands comes from sources within a decade of 
10 ''^ cgs (which is about where the break flux for the broken 
power-law lies). This fractional contribution reaches about 50% 
of the total in the soft and hard bands. Medium depth surveys 
such as AXIS (and indeed the IXMM catalogue, and its suc- 
cessor 2XMM) therefore are instrumental in understanding the 
evolution of the X-ray emission in the Universe, at least up to 
lOkeV. 

Hickox & Markevitch (I2006I I have recently re-estimated the 
soft and hard XRB intensities using CDF data, finding lower re- 
solved fractions of the XRB than our (and previous) estimates. 
We have found that the difference with our results in the soft 
band (where it is highest) is only at the ~l-cr level, and it is 
mainly due to the different ways in which the total XRB inten- 
sity is calculated. HM06 add different contributions at different 
fluxes, which produces an "effective" spectral shape of the XRB 
(F ~ 1.8) which is much steeper than the "canonical" XRB spec- 
tral slope of F = 1.4 (which we have assumed). We have shown 
that the difference between both estimates of the soft XRB in- 
tensity cannot be removed just by relaxing our assumption. The 
HM06 intensity from resolved sources is very similar to ours. 
Converting our results to 1 -2 ke V, the difference with HM06 all 
but vanishes. 

After excluding two fields in which there is evidence for the 
presence of stellar clusters, we have used the AXIS sources to 
study the presence of cosmological structure in the X-ray sky, 
through the cosmic variance in the number of sources per field, 
and the distribution of the angular separations of the sources. 
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The first one is in principle more sensitive to the presence of sig- 
nificant over-densities in a few fields, while the second looks for 
an overall angular clustering of sources. No cosmic variance is 
detected at all in the hard and ultra-hard bands, while some sig- 
nal at about the 90% level is present in the soft and XID bands, 
probably because they are the ones with the larger number of 
sources. Angular clustering is studied in two ways. The first one 
is the angular correlation function W(ff). Using this method, we 
detect signal at about the 99% level in the soft and XID bands, 
but not in the hard and ultra-hard bands (Table |8] and Fig. |5]|. 
The strength of the clustering signal we have found is interme- 
diate between those of previous results (e.g. Gandhi et al. 120061 
Basilakos et al. ,2004. 2005, VikhHnin & Forman [19951). 

A formally more appropriate method using Poisson statistics 
detects clustering at the 99-99.9% level in the soft and XID sam- 
ples, but not in the hard or ultra-hard bands (Fig.|6]l. Repeating 
the test for a randomly l-in-3 selected sample of soft sources 
completely destroys the clustering signal, which we take to im- 
ply that there might be a "real" angular clustering among hard 
sources which we have failed to detect due to the small number 
of sources. Dividing the soft sample in several sub-samples re- 
veals that the signal is widespread over the sky, and not limited 
to a few deep fields. This means that, if it has a cosmic origin, 
it must come from z < 1.5, the peak of the redshift distribution 
of medium flux X-ray surveys (Barcons et al. 2007). We can- 
not confirm the detection of signal among hard-spectrum hard 
sources reported by Gandhi et al. ( I2006I I. 
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Appendix A: Empirical sensitivity maps 

The present version of esensmap (the SAS task to calculate sen- 
sitivity maps for the EPIC cameras on board XMM-Newton) as- 
sumes pure Poisson statistics to evaluate the detection sensitivity 
at different positions in the field of view. Since the detection of 
the sources and the determination of their parameters is a com- 
plicated process, which includes how well does the profile of the 
count rates match the PSF, the result is bound to deviate from 
the naive assumption of pure Poisson statistics. 

We have followed an empirical approach, looking for a sim- 
ple relationship between the observed EPIC pn count rates of the 
detected sources in each band cr (in cts/s, columns RATE in the 
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SAS source lists), and the pure Poisson count rate crpoisim (in 
cts/s) as calculated from the total number of background counts 
bgdim within a circle of radius cutrad (related to the column la- 
belled CUTRAD in the SAS source lists, and given in units of 4" 
pixels) around the source position, the detection likelihood of the 
source L (columns labelled DET_ML in the SAS source lists), and 
the average value of the exposure map (in seconds) within that 
radius expim, so that 

- log{Phgdimi> {bgdim + crpoisim X expim)) — L (A.l) 

with Pa(> N) as defined in Eq.[3] 

The values of CUTRAD are in principle different for each 
source in each field, since they are chosen by emldetect to 
maximise the signal-to-noise ratio for source detection. We have 
found that there is no significant correlation between CUTRAD 
and either the off-axis angle of the source, RATE or BGJIAP (the 
average background value in counts per pixel within the extrac- 
tion circle), and therefore we have adopted for cutrad in each 
band the average of CUTRAD for the sources having L between 8 
and 20 in that band. We have checked that there is no significant 
difference in the goodness of the fits below between using the 
actual CUTRAD value for each source or the average cutrad. 

expim above is the average of the exposure map within a 
circle of radius cutrad around the X-ray source positions. The 
exposure maps for the single bands are PPS products. We have 
created exposure maps for the composite bands using the SAS 
task eexpmap, which generates exposure maps for the energy 
in the middle of the PI interval, which could be inaccurate for 
wide bands. To assess the effect of this approximation we have 
also calculated "average" exposure maps in the composite bands, 
weighting the single band exposure maps with the counts in the 
images pixel by pixel. Again, there are no significant differences 
between the results from those two types of exposure maps. 

Unfortunately, the background files are not PPS products. 
We have generated them from the individual and composite 
bands using the SAS task esplinemap, which excludes areas 
around the sources in an input source list, and fits the remaining 
background using a spline (with 16 nodes in our case). We have 
then calculated bgdim by summing the values of the background 
image over a circle of radius cutrad. 

We show in Fig. lA. II plots of RATE versus crpoisim for the 
single and composite bands, along with the best linear fit model 
(RATE= LI X crpoisim), for sources having 8 < L < 20 in each 
band. The best fit LI along with the values of cutrad for each 
band are given in Table lA. f| From the number of sources in each 
fit and the values of the it is clear that the fits are quite good, 
and more sophisticated models are not needed. The values of LI 
are all within ~10% of 1, implying that the correction is small (as 
expected), but from the improvement of the;^^^ values from using 
LI = 1 (which is equivalent to RATE= crpoisim), we conclude 
that it is also highly significant. 

The recipe for creating a sensitivity map is then: 

1 . Create background and exposure maps 

2. Chose a likelihood value L for the significance of the detec- 
tions. This recipe has only been tested for 8 < L < 20, but it 
could be in principle valid for likelihood values outside this 
interval. 

3. Choose a source extraction radius cutrad appropriate for the 
band you are interested in (Table IaTTT i 

4. For each pixel {X, Y) in your input image: 

(a) Calculate the sum of the values of the pixels in the back- 
ground map whose centres are within cutrad of {X, Y): 
bgdim 



Table A.l. Summary of the results of the Unear fits of crpoisim 
to RATE: Band is the band used for the fit, cutrad is the source 
extraction radius (in units of 4" pixels), LI is the best fit mul- 
tiplying constant, is the number of sources in the fit, x^j is 
the value of the best linear fit, while Xq is the x^ of the fit to 
LI = I, and P(F) the F-test probability of letting LI + 1 not 
being a significant improvement in the fit 



Band 


cutrad 


LI 


N 


Xu 


Xt, 


P{F) 




(pixels) 












1 


5.12 


1.14 


284 


104.5 


185.5 


< 10"" 


2 (soft) 


5.08 


1.10 


653 


140.1 


226.0 


< 10-* 


3 


5.15 


1.14 


414 


105.7 


208.4 


< 10-*^ 


4 (ultra-hard) 


5.49 


1.14 


127 


25.7 


56.2 


< 10-* 


5 


5.86 


1.15 


16 


1.7 


5.5 


< 10-* 


9 (XID) 


5.04 


1.02 


722 


141.0 


145.2 


< 10-^ 


3-5 (hard) 


5.18 


0.89 


243 


176.2 


237.8 


< 10-* 



(b) Calculate the average of the values of the pixels in the 
exposure map whose centres are within cutrad of {X, Y)\ 
expim 

(c) Find ctspoisim such that -\og(Phgdimi> (ctspoisim + 
bgdim))) — L 

(d) Calculate sensiX, Y) — ctspoisim/expimxlA. 

5. sensiX, Y) is our empirical estimate of the typical count rate 
of a source at {X, Y) detected with likelihood L 



Appendix B: Evaluating the count rate spectral fit 

To check the validity of our procedure we have performed both 
internal and external tests. The simplest internal check is to sim- 
ulate sources with a single spectral slope (F = 2 in our case), 
fit them, and compare the output slope values to the input value. 
We have kept the total (0.5-10 keV) count rates of the sources 
fixed to the observed count rates, and simulated the count rates 
in each band using a Poisson deviate from the expected num- 
ber of counts for the fixed slope and the exposure times Texp in 
Table [1] The eiTor on the simulated count rates (AC/?) were a 
bit more delicate to estimate. Using simply ACR - ^[CRJY^ 
would not be adequate, because it does not include the contribu- 
tion from the background subtraction, and because the statistics 
of the source detection by emldetect are very close to, but not 
exactly, Poissonian. We have used the selected sources to look 
for an empirical relationship between the observed ACR and the 
naive ^JCWJY^. We have found that a linear relationship is a 
good visual fit, with the parameters given in Table IB. 11 This is 
the recipe we have used to simulate the errors in the count rates 
in our simulations. The weighted averages of the fitted slopes 
are given in Table IB. 21 showing that any internal deviations are 
small (< 0.1), and that there are no obvious biases for sources 
selected in diff'erent bands. We have repeated the simulations for 
F = 1.8 and T = 1.7, with similar results and conclusions. 

We have also performed an external check against the sin- 
gle power law fits to the "properly" extracted spectra by Mateos 
et al. (2005). In that work the fits were done using xspec with 
response matrices and effective areas generated for each source 
with the corresponding SAS tasks. Since the Mateos et al. (2005) 
spectra were extracted between 0.2 and ~12 keV, we have fit- 
ted our sources using bands 2, 3, 4, and 5. In Fig. IB. 11 we show 
their best fit single power law slopes vs. our best fit slopes for 
the common accepted sources (a total of 1 143 sources, which 



24 



F.J. Carrera et al.: The XMM-Newton serendipitous survey' 



d 




5x10"* 10"^ 2x10"' 5x10"' 0.01 0.02 5x10"* 10"' 2x10"' 5x10" 



crpoisim crpoisim 




5x10"* 10"' 2x10"' 5x10"' 0.01 0.02 5x10"* 10"' 2x10"' 5x10" 



crpoisim crpoisim 




crpoisim 



Fig. A.l. RATE vs. crpoisim for all bands. Also shown are the best linear fits. 
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Table B.l. Empirical relationship between the observed errors 
on the count rates ACR and the naive ^^JCWJY^. We give the 
parameters a and of a linear relation ACR - ax ^JCR/Tg^p + b 



band 


a 


b 


2 


1.597 




3 


1.419 


3.824 X 10-5 


4 


1.452 


6.166 X 10-5 


5 


2.350 


1.184 X 10-* 




5 10 



Fig. B.l. Full band power-law slope from Mateos et al. (2005) 
vs. our 0.5-10 keV slope for the common accepted sources. The 
soUd Une shows the 1 : 1 relation. 



include some sources fitted by Mateos et al. but not included 
in their paper, Mateos private communication). The best fit 
proportionality constant using errors in both the X and Y axis 
is 0.980±0.003. This result makes us confident in our fitting 
method, showing that we haven't missed significant effects that 
were not already corrected in the emldetect count rates. A 
number of sources deviate appreciably from the 1 : 1 relation in 
Fig. IB. II the eff'ect being more noticeable to the eye for values 
of ro.2-i2keV between about 1.7 and 3. Within that interval, the 
number of sources with Foj-iokev > 3 is only 14% of the total 
number of sources in that interval, this simple test already indi- 
cates that the number of sources with pathological spectral fits is 
a small minority. We will discuss the influence of these sources 
in more detail below. We show in Fig. |B.2| our best fit slopes ver- 
sus ourO.5-12 keV countrates. There are some sources with very 
steep fitted slopes (because they have only been significantly de- 
tected in one band), but the bulk of our sources have reasonably 
"standard" spectral slopes, with no significant trends towards too 
steep or too faint slopes at lower count rates. 

Individually the fits are therefore good, but we have also as- 
sessed whether there are significant biases in the average slope, 
and whether the error bars on the slopes are adequate. We have 
first studied the distribution of the differences between the slopes 
divided by the quadratic sum of the errors (which would be ex- 
pected to follow a A^(0, 1) distribution in the Gaussian case). 
There is a small bias in these "normalised" differences of ~20% 
of the combined error bar (such us we tend to find steeper slopes 
than Mateos et al.), with a significant dispersion above the ex- 
pected value. Fitting a Gaussian to this distribution (which is not 
a Gaussian) finds a similar central value, but a much smaller dis- 
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Fig. B.2. Our 0.5-10 keV slope versus the 0.5 to 12 count rate 
for our 1433 sources. 

Table B.2. Number of sources selected in different bands A^, 
weighted average slopes (F) and errors (taking into account both 
the error bars in the individual F and the dispersion around the 
mean), and number of sources used in the average A^ave, for the 
simulated sources (see Section|3]l. The soft, hard, XID and ultra- 
hard bands are as defined in the text. "Soft and hard" refers 
to sources selected simultaneously in the soft and hard bands, 
"Only soft" refers to sources selected in the soft band but not in 
the hard band, and "Only hard" refers to sources selected in the 
hard band and not in the soft band 



0.5-2 keV 2-10 keV 



Selection 


N 


^ :ive 


<r> 




<r> 


Soft 


1267 


1267 


2.008±0.004 


1259 


2.029±0.007 


Hard 


397 


397 


2.008±0.005 


396 


2.039±0.010 


XID 


1359 


1359 


2.007±0.004 


1352 


2.030±0.007 


Ultra-hard 


91 


91 


2.014±0.008 


91 


2.019±0.013 


Soft and hard 


345 


345 


2.009±0.005 


345 


2.038±0.010 


Only soft 


922 


922 


2.007±0.006 


914 


2.005+0.013 


Only hard 


52 


52 


1.99 ±0.03 


51 


2.10 ±0.05 



persion, indicating that much of the excess variance is due to a 
limited number of significant outliers. 

Assuming Gaussian statistics, the error on the mean can be 
obtained dividing the standard deviation by the square root of the 
number of sources (~1000 in our case). If we perform this calcu- 
lation for our normalised differences, the value is much smaller 
than the ~20% relative bias above, which means that the bias is 
significant (many times the "error on the mean"), despite its low 
relative value. 

Studying the distribution of the absolute differences between 
the slopes (without dividing by the combined error bar), again 
there is a bias towards softer slopes in our sample of AF ~0.12, 
with a very significant dispersion, which again is much reduced 
in the Gaussian fit. Our conclusion is similar to above. 

If we compare our weighted average slopes with Mateos et 
al, there is also a difference, but in the sense of our slopes being 
flatter than theirs. Since the weighted average essentially dis- 
cards the sources with large error bars, this means that some of 
the outliers discussed above are soft sources with large error bars 
which bias the arithmetic average towards softer slopes. 
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